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Abstract 

For d > 3, we construct a non-randomized, fair and translation- 
equivariant allocation of Lebesgue measure to the points of a standard 
Poisson point process in M. d , denned by allocating to each of the Pois- 
son points its basin of attraction with respect to the flow induced by 
a gravitational force field exerted by the points of the Poisson pro- 
cess. We prove that this allocation rule is economical in the sense 
that the allocation diameter, defined as the diameter X of the basin 
of attraction containing the origin, is a random variable with a rapidly 
decaying tail. Specifically, we have the tail bound 

F(X > R) < C exp [ - ci?(log R) ad 

for all R > 2, where: ad = for d > 4; «3 can be taken as any 
number < —4/3; and C, c are positive constants that depend on d and 
ad- This is the first construction of an allocation rule of Lebesgue 
measure to a Poisson point process with subpolynomial decay of the 
tail F(X > R). 
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1 Introduction 



Let d G N. Let H be a discrete subset of M d . We call the elements of S 
centers. An allocation (of Lebesgue measure to H) is a measurable function 
tfj : M d — > E U {00} that satisfies 

Vol(V> -1 (oo)) = 0, 
Vol^" 1 ^)) = 1, ze~, 

where Vol( • ) is Lebesgue measure in M. d . For z G H, we call ?/> _1 (z) the cell 
allocated to 2. In other words, an allocation is a way of partitioning M. d into 
cells of Lebesgue measure 1 that together cover M. d up to a set of measure 0, 
and assigning them to the points of S. 

Let Z be a translation-invariant simple point process in M. d with unit 
intensity defined on some probability space (Q, P). That is, Z is a random 
discrete subset of M d such that for any open set A C M. d , the random variable 
\A fl Z| (where \E\ denotes the cardinality of a set E) has mean Vol(A), 
and for any x G M d and open sets Ai, A 2 , . . . , A^ C M d , the random vector 
(|(Ai+x)nZ|, |(y4 2 +x)nZ|, . . . , |(Afc+x)nZ|) has distribution that does not 
depend on x. An allocation rule (of Lebesgue measure to Z) is a mapping 
Z — >• that is defined P-a.s., measurable (with respect to the relevant a- 
algebras), and such that: (i) almost surely ipz is an allocation of Lebesgue 
measure to Z, and (ii) the mapping Z — ► %pz is translation-equivariant, in 
the sense that P-a.s., for any x,y G M. d we have 

ipz+x(y + x) = ?Pz(y) + x- 

Figure [T] shows a particularly important example of an allocation rule 
that gave much of the inspiration for the current paper - see below. 

An allocation rule Z — > ipz may satisfy several additional desirable prop- 
erties: each cell ip^i^z) may be open, may contain its "owner" z; each cell 
may be connected; each cell may be bounded. In the event that a.s. 
all the cells are bounded, one may consider the allocation diameter, which 
is the random variable 

X = diam^^O))), 

where diam( ■ ) denotes the diameter of a set. The rate of decay of the tail 
P(A > R) of the distribution of X as R — > 00 can be used as a quantitative 
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Figure 1: The Nazarov-Sodin-Tsirelson-Volberg gradient flow allocation, 
equivalent to 2-dimensional gravitational allocation. It can be denned for 
a finite point set or for the process of zeros of the Gaussian Entire Func- 
tion. For the Poisson point process we construct the analogous allocation in 
dimensions 3 and higher. (Picture due to Manjunath Krishnapur). 



measure for how economical the allocation rule is; roughly, a fast rate of 
decay means that it is rarer for points to be allocated to a far-away location. 
Note that by translation-equivariance one may take the diameter of the cell 
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containing any given point x G W 1 and get a random variable 
equal in distribution to X. 

Holroyd and Peres [8] showed that if d = 1,2 and Z is a standard Poisson 
point process of unit intensity in M. d , then for every allocation rule the allo- 
cation diameter X satisfies ~EX d / 2 = oo. In particular, in this case the decay 
of ¥(X > R) to cannot be faster than polynomial in R. Hoffman, Holroyd 
and Peres [TU] constructed an allocation rule for every translation-invariant 
point proces in M. d with unit intensity, the stable marriage allocation, in 
every dimension d > 1. In the stable marriage allocation, almost surely all 
the cells are open, bounded and contain their owners, but not all are con- 
nected, and when Z is a Poisson point process the allocation diameter X 
satisfies KX d = oo. The stable marriage allocation rule is the unique one 
which has the "stability" property that almost surely, for any z, z' G Z and 
x,y G R d , if ipz{%) = z and ipz(y) = z ' then \x — z'\ > \x — z\ A \y — z'\. 

Nazarov, Sodin and Volberg [TJ] recently constructed an allocation rule 
based on an idea suggested by Tsirelson in [15J . Their allocation rule is 
defined for the two-dimensional point process X of zeros of the Gaussian 
Entire Function (GEF), which is the random analytic function 



„_n yn. 



n=0 

where (£ n )^L are i.i.d. standard complex gaussian random variables. In their 
construction, the cell of each z G X is defined as the basin of attraction 
of z with respect to the flow induced by the random planar vector field 
z — > (V log |/|) (z) — z. The cells are connected by definition, and in [TJ] 
it was proved that they are a.s. bounded, each have area 7r (which is the 
reciprocal of the mean density of points in the process of zeros of the GEF), 
and that there exist absolute constants C, c > such that the allocation 
diameter X satisfies 

ce -Cmo g R)^ < p(x > R) < Ce -cR(lo g Rf/^ R> 1. 

Figure Q] shows a simulation of the Nazarov-Sodin-Tsirelson- Volberg gra- 
dient flow allocation. Figure [2] shows the graph of the potential function 
log | y | associated with the allocation (where / is an approximation to the 
GEF). Figure [3] shows a simulation of the stable marriage allocation in 2 
dimensions. 
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Figure 2: The potential function associated with planar gravitational alloca- 
tion (picture due to Manjunath Krishnapur). 

In this paper, we construct a new allocation rule of Lebesgue measure 
to the points of the standard Poisson point process in M d , for any d > 3. 
Our construction was inspired by the gradient flow allocation, and we call it 
gravitational allocation. To define it, denote by Z the standard Poisson 
process in W d . Consider the random vector field F : IR d — ► WL d defined by 



where the summands are arranged in order of increasing distance from x. 
The term (z — x)/\z — x\ d represents a gravitational force felt by a unit mass 
at a point x due to the influence of a unit mass placed at point z. When 
d = 3, this is the ordinary Newtonian gravitational force. An elementary 
observation that can be traced back to Chandrasekhar [3] (see also [6]), based 
on the Kolmogorov three-series theorem, is that for any fixed x G M. d , the 
infinite series for F(x) converges almost surely (this is true for d > 3). The 




ZdZ, \z—%\\ 



(1) 
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Figure 3: The two-dimensional stable marriage allocation for a Poisson pro- 
cess (picture due to Alexander E. Holroyd). 

random vector F(x) has a symmetric stable distribution of index -A^-. This 
can be seen using a simple scaling argument (see Remark (iv) below), or by 
an exact computation, see j6]. 

We prove the following result concerning the process of gravitational 
forces acting simultaneously on all points of lR d : 

Proposition 1 (Simultaneous convergence and differentiability). Assume 
d>3. Almost surely, the series in converges simultaneously for all x for 
which it is defined (namely, allx G WL d \Z) and defines a translation-invariant 
(in distribution) vector-valued random function. The random function F is 
almost surely continuously differentiable where it is defined. 

Note that since the sum in ([1]) does not converge absolutely, the choice of 
the order of summation is essential for Proposition [1] and the results below 
to hold. 

Consider now the integral curves Y(t) of the vector field F, that is, solu- 
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tions of the equation 

Y(t) = F(Y(t)). 

We call these curves the gravitational flow curves (in a simplified inertia- 
less Newtonian gravitational world). For x G denote by Y x the integral 
curve with initial condition 1^(0) = x. By Proposition [1] and standard ODE 
existence and uniqueness theorems, Y x is defined up to some maximal positive 
time t x (where possibly t x = oo). For each center z G Z, say that the curve 
Y x ends at z if lim^ Y x (t) = z, and define the basin of attraction of z 
by 

B{z) = {x G R d \ Z | Y x (t) ends at z} U {z}. 

Define 

z x G B(z) for z G Z, 

00 x^\J zeZ B(z). 

Our main result is the following theorem. 



Theorem 2 (Fairness and efficiency of the allocation). The mapping Z — > 
ipz is an allocation rule of Lebesgue measure to the Poisson point process 
Z. Almost surely all the cells ip^^z) are bounded. The allocation diameter 
X = diam(ip2 (ipz(0))) satisfies the following tail bounds: In dimensions 4 
and higher, we have 



F(X > R) < Ci exp - c 2 R(hg R) 



d— 2 
d 



(2) 



for some constants C\ = Ci(d),C2 = 02(d) > and all R > 2. In dimension 
3, for any a > there exist constants C\ = Ci(a), c 2 = 02(a) > (depending 
on a) such that for all R> 2 we have 



P(X > R) < G x exp 



R 



C'2 



(logi2)T 



(3) 



Note that the cells in gravitational allocation are open, connected and 
contain their owners. They are also contractible, see Remark (v) below. 
Figure H] shows a simulation of a cell in 3-dimensional gravitational allocation. 

For L > and x G M. d denote by Q(x, L) the box x + [— L, L] d . A main 
ingredient in the proof of Theorem 2 is the following result. 
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Figure 4: Simulation of a cell in 3-dimensional gravitational allocation 



Theorem 3 (Bounds for the probability of an i?-crossing) . Let Er denote 
the event that there exists an integral curve Y(t) connecting dQ(0,R) and 
dQ(0,2R) (in either order). Then, if d > 4 then we have 



P(£fl)<C ie xp -c 2 R(logR) 



d-2 
d 



for some constants C% = C\(d),C2 = 02(d) > and all R> 2. In dimension 
3, for any a > there exist constants C\ = C\(a), c 2 = 02(a) > (depending 
on a) such that for all R> 2 we have 



HE R ) <C ie xp 



R 



02- 



(logR)3 +l 

In a forthcoming paper [4J, we will prove lower bounds for the tail of 
the distribution of the allocation diameter X, and additional bounds on the 
distance |^(0)| of the origin from its star. 
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Further remarks, (i) Allocation rules have an equivalent description as 
non-randomized extra head rules. If Z is a translation- invariant simple 
point process of unit intensity in M. d , an extra head rule for Z is a random 
variable T coupled with Z so that a.s. T G Z and the random set Z — T 
has the same distribution as Z conditioned to have a point at 0. The extra 
head rule is said to be non-randomized if T is measurable with respect to Z. 
In [S] it was shown that if ipz is an allocation rule then T = T z = ipzify is 
a non-randomized extra head rule, and conversely, given a non-randomized 
extra head rule Tz, the mapping ^z(x) = Tz- X is an allocation rule. 

(ii) For any u G M. d one may replace the vector field F(x) by F(x) + u and 
obtain a modified allocation rule. Thus, there is more than one possible 
construction of an allocation rule involving the gravitational field, and one 
might speculate that a suitable modification of the construction might lead 
to better tail bounds for the allocation diameter. 

(iii) For some results on the related topic of translation- invariant perfect 
matchings for point processes, see [7]. For related results on matchings be- 
tween random point configurations in a finite setting, see the papers [U [T2J 

EE]. 

(iv) Here is a simple argument proving that for fixed x the force vector F(x) 
has a stable distribution with scaling exponent d/(d — 1) (we believe this 
argument is known but could not locate a reference; the proof of this fact in 
[B] uses explicit computations and is more complicated). If Fx, F 2 , . . . ,F n are 
i.i.d. copies of F(x), then their sum is the force exerted on x by the union 
of n independent copies of the Poisson process, which is a Poisson process 
with intensity n (or equivalently a Poisson process of unit intensity scaled by 
n -V*). Thus, because the individual force terms scale as the (d— l)th power 
of the distance, by rescaling it follows that Fx + F 2 + . . . + F n has the same 
law as n ( - d ~ 1 ^ d F(x), which proves our claim. 

(v) Another interesting property of gravitational allocation is that the cells 
are contractible. This is immediate from their definition as the basins of 
attraction with respect to the flow of the vector field F. Formally, denote 
by ($t)t>o the flow semigroup of the vector field F, and for each x G M. d 
denote by r x the time for x to flow to its star ipz{%) (that is, the maximal 
time for which the curve Y x is defined, or for the star). Then if z G Z and 
B(z) is its basin of attraction, the mapping ip : B(z) x [0, 1] — > B(z) defined 
by <f(x,t) = <& Txt (x) is a homotopy between the identity map id B ^ and the 
constant mapping B(z) —>■ z. (Note that almost surely, for all x G B(z) \ {z} 
we have t x < oo, since by definition we have that \im^ Tx Y x (t) = z, and 
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F(u) = (z — u)/\z — u\ d + 0(l) when u — > z, so that once the flow curve Y x (t) 
approaches z, it must reach z in a finite time.) 

A reading guide. Here is a guide to reading the rest of the paper. Section 
H] introduces some notation and recalls some standard estimates for Poisson 
random variables. In section [3] we give outlines of the proofs of the main 
claims, which we hope will give the reader a higher-level picture of the ideas 
in the paper and will simplify reading the more technical later sections. In 
section H] we show how the main result, Theorem [21 can be deduced fairly 
easily from Theorem 3 (bounds for the probability of an i?-crossing) . These 
sections are easy to read and we recommend starting with them. 

The remaining sections constitute the main technical parts of the paper. 
Sections El [7] and [8] are the "preparation" part: In Section [5] we prove 
Proposition [1] (simultaneous convergence and differentiability of the force) 
and Proposition (a useful alternative formula for the force, see Section [3] 
below). Section [7] contains a similar but slightly more difficult analysis for the 
gravitational potential function, an auxiliary function that is defined only 
in dimensions 5 and higher. In Section [6] we prove an important auxiliary 
theorem bounding the joint density of a vector of forces. In Section[8]we prove 
large deviations results that will be used repeatedly as the main "engine" in 
the proof of Theorem [31 

Finally, Sections [9] and [10] contain the proof of Theorem 3. The proof is 
split into two parts. The first and slightly simpler case is the proof in di- 
mensions 5 and higher. The last section, Section [101 treats the more delicate 
case of dimensions 3 and 4. We recommend to the reader who is mainly 
interested in our main result to only skim through the results in Sections El 
[61 [7J and [8] and to proceed to Section [91 However, we believe the results in 
these auxiliary sections to be of significant independent interest. 

Acknowledgements We are grateful to Manjunath Krishnapur, Misha 
Sodin and Fedor Nazarov for helpful comments and discussions, and to an 
anonymous referee for suggesting innumerable valuable comments and cor- 
rections, including a shorter and more elegant proof of the uniqueness of 
solution of a system of equations that forms part of the proof of Theorem [TU1 
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2 Preliminaries 



Here is some notation that we will use throughout the paper: d denotes the 
dimension, and will always be an integer > 3 (in some theorems it will be 
assumed explicitly that d > 5 or that d = 4 or that d — 3). We denote by 
\x\ the Euclidean norm of a vector. We denote Lebesgue measure in M. d by 
Vol( • ) . If V = (V\, Vk) is a random vector, we denote by Var(V) the sum 
of the variances of its coordinates. Let = n d / 2 /T(d/2 + 1) be the volume 
of the unit ball in M. d . Denote by Z the Poisson process of unit intensity on 
R d , and by P the probability measure on the probability space on which it is 
defined. For concreteness we denote Z = {zi)°? =1 for the specific ordering of 
the points of Z by increasing distance from 0. For obvious reasons we refer 
to the Zi as stars. The letters C, c (possibly with subscripts) will be used 
to denote various positive constants that may depend on the dimension d, 
where C will typically be a large positive constant and c will typically be a 
small positive constant, and the same symbols (such as Ci, etc.) may be used 
in different places with different numerical values. Big-0 notation will be 
used, and it is understood that all constants implicit therein may depend on 
d (and occasionally on other parameters that are kept constant throughout 
the discussion). We denote by B(x,L) the ball of radius L around x £ M d , 
and by Q(x, L) the box x + [-L, L] d . This notation and other notations that 
are used frequently in the paper are summarized in Table 1, which may be 
used for reference. 

Lastly, the following lemma gathers some standard deviations estimates 
on Poisson random variables; see [9] for more details. 

Lemma 4. Let X be a Poisson random variable with mean A. Then: 

(i) Ift > 2A then 

F(X >t)< (TiM!). 

(ii) There exists a 5 > such that for all t G [0, 8] we have 

F{\X-X\ > t\) < 2e~ A * 2/3 . 

Proof. For t>X set s = t/X in the inequality s*P(X >t)< E(s x ) = e A(s_1) , 
to get 

nx >t)< e - tlo <i) +t - x = e - A (i 1 °s(!H+ 1 ). (4) 
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Table 1: Summary of the main notation used in the paper 



Symbol 


Sections 


Meaning 


d 


all 


The dimension, an integer > 3. In Section 5, d > 5. 


B(Xj t) 


ii 

all 


Ball of radius t around x E Mr. 


Q(x,t) 


1 1 

all 


The cube x + |— t, t\ a . 




all 


3, j 2 _ pjl * j 1 1 1 * TTTi f\ 


r(d/2+i) = volume of the unit bail m Mr. 


7 _ ( ~ \oo 
Z> — \Zi)i=\ 


all 

an 


i ne siais . a siaiiQaiu. iroisson pomi piocess m ir^. . 


Fix) 


all 


The random gravitational force field induced by Z. 




1, 3, 9 


The event of a gravitational flow curve crossing be- 
tween dQ(0,R) and dQ(0,2R). 


g(x) 


5, 6, 8 


9{x) = #• 


Dk{-\ 


5, 6, 7, 8, 9 


The /c-th derivative tensor of a function. 


R 


all 


The main parameter. 


B 


9, 10 


R^ (In Section |10.4[ ^^). 


A 


9, 10 


A large constant. 


r 


9, 10 


A • (\ogR) 2 ' d (In Section HM (log R) 1 ^ log logi2). 


P 


9, 10 


R-V 10 (In Section |10.4[ tj-^). 
i 


s 


9, 10 


R io(^+i) (i n Section 11041 W)- 


c~ 
c 


9 1 


-2, lncr (4n Sprtion ll 41 

s d lu £, A ^111 OeCUOll l±JJ^±J p-j. 

2 (^) (logi?) 1 ^ (In Section|10.4| Vloglog.R). 


A 


9, 10 


U(x) 


7, 8,9 


The stationary centered gravitational potential. 


U(x | A) 


7, 8, 9, 10 


Centered contribution to the potential from stars in 
the set A. 


F(x | A) 


8, 9, 10 


Contribution to the force from stars in A. 


S 


9, 10 


The grid S = rZ n (Q(0, 2R) \ Q(0, R)). 


S w 


9, 10 


For w E S, the subgrid sZ fl (Q(w, 2r) \ r)). 


T 


9, 10 


For w E S, the subgrid pZ fl (Q(w, 2r) \ r)). 




9, 10 


Global atypical events with negligible probability. 




9 


Local atypical event. 


^5,W 


9 


The event will hold for more than half of W. 


&>6,W 


9 


The event that more than half of W is percolating. 




9 


The event that for all w E W there are many black 
and not 4-crowded points in T w and fl^nfll w oc- 
curred. 


U diS (x, y) 


10 


The centered potential difference function. 


U d[S (x, y\A) 


10 


Contribution to potential difference from stars in A. 



13 



Since if u := | > 2, the inequality |wlogu > u — 1 can be seen to hold, we 

get in that case that ¥(X > t) < e~ jtlos ( J \ proving (i). 

To prove (ii), note that by the same method, if < t < A one can set 
s = t/X in the inequality F(X < t) < s"*E(s x ) to obtain 

F(X <t) <e~ x d l <^-i +1 ). (5) 
Now, from (j3J) we get using a second order Taylor approximation that 

P(X - A > tX) < e -M(i+t)iog(i+t)-t) < e -At 2 /3 

for t e [0,5]. A similar bound for P(X — A < tX) follows similarly from 
P- □ 



3 Proof outlines 

We give a sketch of the proofs of the main results in the paper. This section 
is only included as an outline in order to give the reader a general feeling for 
the ideas used and to facilitate understanding of the detailed proofs in the 
later sections. 

Equal volume of the basins of attraction. Of all the results mentioned 
above, one of the most interesting and surprising is that a.s. all the basins 
of attraction have volume 1. This claim is relatively easy to prove, given the 
fact that the basins of attraction are a.s. bounded, and if we also assume that 
they have piecewise smooth boundaries (in Section H] we give a detailed proof 
of the equal volumes property which does not use any information on the 
smoothness of the boundaries). Here is the proof, which is an adaptation of an 
argument due to Boris Tsirelson [15] . First, we need an alternative expression 
for the force F(x) that does not involve a different order of summation at 
every point x. In Section [5] we prove the following formula. 

Proposition 5. Almost surely, for any x E~R d \Z we have 

oo 

F ( x ) = Y] ] n + K d x, (6) 

zez, |z|t 1 1 

where = n d / 2 /T(d/2 + 1) is the volume of the unit ball in W 1 and the 
summation is in order of increasing distance from 0. 
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Now, for a given basin of attraction B(zo), consider the oriented surface 
integral 



where n is the outward-pointing normal vector. We evaluate this integral 
in two ways. First, it is equal to 0, since by the definition of the basin of 
attraction, on dB(zo) we actually have F(x) ■ n = 0; this is because if F(x) 
had a component in the direction of n, there would be a flow curve crossing 
from one side of dB(zo) to the other. Second, the integral may be evaluated 
using the divergence theorem. Note that the function g : R d — > ~R d defined 
by g(x) = x/\x\ d satisfies div(g) = dK d 5 in the distribution sense, where 8 y 
denotes a Dirac delta function at y. Therefore, using §6§ we have 



in the distribution sense, and therefore, since B(zq) contains only the star 
z , we have 



whence Vol(B(z )) = 1. 

The proof of Theorem 3. Our proof of Theorem 3 was inspired by, and 
follows the rough outline of, the proof of the main result of [14, version 
1], though several new conceptual and technical features are added. The 
basic idea is as follows. The event Er is defined in terms of the continuous- 
space force field F(x) and is therefore hard to control. We bound it in 
terms of discrete events, by dividing space into a grid of cubes of side length 
r « (log-R) 2//d . Introduce a gravitational potential energy function U(x) 
whose differences U(x) — U(y) are the line integral of the gravitational force. 
Let B = R 8 ^ 9 . If there is a gravitational flow curve T crossing between 
dQ(0,R) and dQ(0,2R), then either U(x) > B for some x G Q(0,2R) (an 
event which can be shown to be of negligible probability), or if not, then 
"many" (a positive fraction) of the r-grid cubes intersected by the curve 
T have the property that either U(x) < —B for x in that part of T that 
intersects the cube or the change in potential energy along that part of the 









JdB(z ) JB{zo) 



[ F(x)-ndS= [ div(F)dx = dn d (-l + Vol(£(z ))), 
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curve that intersects the cube is smaller than a constant times Br/R. Call 
such an r-cube "bad" . 

Now, if we could prove that the probability for each cube to be bad is 
bounded from above by R~ s for some small 6 > 0, and that the events of 
different cubes being bad are approximately independent on an appropriate 
scale, then Theorem [3J would follow using standard subcritical percolation 
techniques. To bound the probability of a cube to be bad, we divide each 
cube into a grid of smaller cubes of size p = i?^ 1 / 10 , and show that if the r- 
cube is bad, it contains many "black" p-subcubes, where, roughly, a subcube 
is called black if it contains a point x where the norm of the force F(x) is 
smaller than a constant times B/R. The probability of a cube to have many 
black subcubes is bounded using a first moment bound, which in this scale 
seems like the best one can do because of the extreme dependence of these 
events (since p « 1). 

As for the approximate independence of the events of different cubes being 
bad, this is not strictly true in the scales under consideration. It is shown 
that the independence requirement can be replaced by a theorem bounding 
the joint density of the force field F(x) evaluated at some set of points. 

To make this skeleton of a proof work, several novel features are required. 
Detailed large deviations estimates are obtained for the gravitational poten- 
tial, the force and its derivative. The joint distribution of the vector of values 
of forces at a given set of points is analyzed in detail. In the analysis of the 
bad cubes, it is necessary to bound the contributions to the potential energy 
from two asymptotic regimes: First, from the effects of nearby stars causing 
the potential function to be close to — oo; this is dealt with using a separate 
percolation argument. Second, from the "intermediate" range consisting of 
the scales between r and R l / d \ these contributions are dealt with by diluting 
the set of potentially bad cubes by at most a factor 1/2 and using a geometric 
covering lemma. Third, from the far range of stars at distance > R 1 ^; this 
is dealt with using the large-deviation estimates. In Dimensions 3 and 4, a 
more delicate argument is required involving a potential energy function that 
is not translation-invariant and has worse large-deviation behavior than in 
high dimensions. 



4 Derivation of Theorem [2 



We now show how Theorem [2] follows from Theorem [3J and Proposition 
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First, the fact that F(Er) — > as R — > oo clearly implies that a.s. all the 
basins of attraction are bounded. 

Next, let zq = ipz{fy be the star whose basin of attraction contains 0, and 
let Bq = -^^(-^(O)) be the basin of attraction of z . Let X be the allocation 
diameter X = diam(i?o). If X > R then there exists an x £ Bo with |x| > y, 
and therefore ||^||oo > it/d^' Now, ^ Ikolloo > it/^ then since is in the 
basin of attraction of zq the event E R ^ Sy q happened. Otherwise, since x is in 

the basin of attraction of z and we have ||^o||oo < ~^jqR < — H 3 '!! 00 ' 

the event E R ^ 4y y^ happened. So we have shown that 

nX>R)<¥(E R/8V1 )+F(E R/iV1 ). 

This implies that the estimates (T5]) and ([3]) follow from the corresponding 
estimates in Theorem [31 

Next, we show that a.s. all basins of attraction have volume 1. We use 
a variant of the argument sketched in Section [3] which does not require any 
knowledge about the smoothness of the boundary of B{. A similar argument 
in a slightly different context was briefly mentioned in [HI Section 12.2]. 

Let Bi be the basin of attraction of the star Zj. As in the introduction, for 
x G Bi denote by r x the time that it takes x to flow into Zi, or equivalently the 
maximum time up to which the integral curve Y x (-) is defined; for continuity 
we set r Zi = 0. For < a < b < 00 denote 

E a ,b = {x e Bi : a <t x <b], 
V a>b = Yo\{E a>b ). 

Note that on Bi \ {z{\ the force field F satisfies div(F) = —dn^, by taking 
the divergence of each term in ([6]) (see Lemma [7] below, which justifies inter- 
changing the divergence and summation operations). Therefore, by a version 
of Liouville's theorem [21 Lemma 1, p. 69], it follows that ^VJ )00 = —dK d V ttOQ , 
so 

V t!O0 = Vo^e-*"*. 
In particular, for t \ we get that 

V ,t = V ,oo - l4,oc = ^0,00 (1 - e"*^) = d Kd V 0>oo t + 0(t 2 ). (7) 

Estimate Vo,t for small t in a different way, as follows. In a neighborhood 
of Zi the field F satisfies F(x) = , Zi Z X \d + 0(1). Without the error term it 
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would be easy to solve the differential equation explicitly, so this implies by 
approximation that 



B(z h {dtf/ d - o(t l / d )) C Eo ti C B( Zi , {dtf/ d + o(*Vd)) 



(this relies on the following easily- verified claim regarding a one-dimensional 
differential inequality: if g is a real-valued function on [0, oo) that satisfies 
0(0) = 0, \g'(t)-g(ty- d \ < C, then g(t) = (dt) l ' d + o(t^ d ) when t \ 0), and 
therefore that 

V 0)t = dK d t + o(t). (8) 

Equating and (JHD gives that Vol(-Bj) = V 0tOO = 1, as claimed. 

We have shown that the basins of attraction are a.s. all bounded and have 
volume 1, and they are clearly disjoint. The last claim that needs proving is 
that a.s. they cover all of M. d except a set of measure 0. We use the following 
mass transport lemma. 

Lemma 6. Let f : Z d x Z d — > [0, oo) satisfy f(m, n) = f(m + u,n + u) for 
any m,n,u G Z d . Then for all n G Z d we have that 



m£Z d 

f(m — n, 0) 
Define / : Z d x Z d -> R by 



m, n 



Proof. f(m,n 



g{m — n), and both sums become just 

□ 



f(m,n) 



E 



Vol (Q(m, 1/2) n |J 5, 



i-.ZieQ(n,l/2) 



or in words the expected volume of the part of Q(m, 1/2) that gets allocated 
to some Zi G Q(n, 1/2). Note that ^2 me z d fi 171 ^ n ) represents the expected 
volume of points in M. d being allocated to some Zi G Q(n, 1/2). Since we 
showed that Vol(-Bj) = 1 for all i, this is equal to the expected number of 
^ G Q(n, 1/2), which is Vol(Q(n, 1/2)) = 1. So, if we denote D = U^Bi, 
the union of all the basins of attraction, then by Lemma [6] we get that for 
all n G Z rf we have 



1 



E 



Vol(D n Q(n, 1/2)) 
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The random variable Vo\(D D Q(n, 1/2)) is bounded from above by 1. If its 
expected value is 1 then it is 1 almost surely. Therefore almost surely 

Vo\{R d \D) = J2 Vol(Q(n, 1/2) \D) = ^0 = 0, 
as claimed. □ 



5 Existence and differentiability of F 

In this section we prove Propositions [1] and [5j 

5.1 Proof of a.s. convergence of F(x) 

First, let us prove that F(x) is well-defined, that is, that the sum in (jTJ 
converges a.s. for fixed x G M d . Since the sum is defined in a translation- 
invariant manner, it is clearly enough to prove that the sum for F(0) con- 
verges a.s. Let p — 0, and for i > 1 let = |^| be the distance of Z{ 
from the origin. Since Z = (ziji is a Poisson process, we then have that the 
random variables [Kd{Pi ~ pf-i)) =1 are i-i-d. with Exp(l) distribution (recall 
= Vol(-B(0, 1))), and therefore by the law of large numbers, almost surely 



Pi 
ji/d 



3-1' 



l/d 



> «J (LLN) 



Now, if we condition on the values of thinking of them as a de- 

terministic sequence such that Pi/i 1 ^ — > ^~^ d as i — ► oo, then each ^ is 
distributed uniformly on the sphere of radius pi around the origin. For any 
i > 1, each term (;?j)/|zj| d in the sum in (where x is taken as 0) has 
(conditional) mean and variance bounded by 0( K p i 2( ~ d ^) = O (i~ 2 ( d ~ i y dS \ . 
Since in the event (LLN) the sum of the variances converges (note that this 
fails in dimension 2), by the Kolmogorov three-series theorem the sum in 01) 
converges a.s. This is true a.s. conditionally on (pi)i, therefore it is true a.s. 
and F(0) is defined. 
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5.2 Simultaneous convergence with a fixed order of 
summands 

Denote as before pi = \zi\. Let g : M. d — > M. d be defined by g(x) = x/\x\ d . Let 
z'o = min{i : p« > 2}. Define 

oo oo 

*(*) = £jf=^ = !>(*-*). ( ) 

i=i i=io 

a function that we will see shortly is closely related to F(x). 

Lemma 7. Almost surely, the sum defining H(x) converges simultaneously 
and uniformly for all x G 5(0, 1) and defines a continuously differentiable 
function. The series can be differentiated termwise. 

Proof. For a function / : W — > W , denote by Dkf the tensor of k-th deriva- 
tives of / (which can be thought of as a (J ■ i fc )-dimensional vector). Note 
that for |x| > 1 we have that 

\D l9 {x)\ = 0(\x\~ d ), (10) 
\D 2 g(x)\ = OCH-*- 1 ), (11) 

and in general for any k > we have 

\D k g(x)\=0(\x\- d - k+1 ), (12) 

where the constant implicit in the big-0 depends on d and on k. The best 
way to see (fTUl) is to represent Dig(x), the matrix of the first differential of 
g at x, in an orthonormal coordinate system containing the radial direction 
this gives a diagonal matrix whose entries are d — 1 copies of \x\~ d and 
one copy of —(d — l)\x\~ d = -^\ r _^r~( d ~ l \ so in fact \Dig(x)\ = C 1 \x\~~ d , 

1 /2 

where G\ = (d(d — 1)) . Equations (ITT]) and (fT2"|) can be proved similarly. 
Now, similarly to define for x G B(0, 1) 

oo 

H x {x) = Y^D^Zi-x)], (13) 

i=i 
oo 

H 2 (x) = J2 D ^9(zi-x)]. (14) 
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Condition on the pi, and condition on the event (LLN). In the previous 
subsection we showed that the sum defining H(0) converges a.s., so assume 
that this holds. A similar argument shows that the sum defining Hi(0) 
converges a.s., so condition on that as well. We shall show that under these 
conditions the sum in (Q converges uniformly on -8(0, 1) to a C 1 function. 

First, f[TTj) together with the assumption that (LLN) holds immediately 
imply that the sum (|14p converges absolutely uniformly on -5(0, 1), and sim- 
ilarly from (fl2"l) the same is true for the sums of differentials of all orders 
k > 2. In particular it follows that (under the above conditioning) H2(x) is 
a C°° function on 5(0,1). 

Next, Hi(x) — Hi(0) can be represented as a sum of line integrals from 
to x of the terms in the sum for H 2 {x). Therefore the sum for Hi(x) — Hi(0) 
converges uniformly on -8(0, 1) to a function whose differential is H2(x), and 
since we assumed that the sum for Hi(0) converges, it follows that the sum 
f fT3|) converges uniformly on -8(0, 1) to a differentiable function. Similarly 
H(x) — H(0) can be represented as a sum of line integrals of the terms in 
f fT3j) . so repeating the above argument, using the fact that we assumed that 
the sum for H(0) converges, gives that the sum in ([§]) converges uniformly on 
-8(0, 1) to a C 1 (in fact, C°°) function. This was true under the conditioning 
on an almost sure event, so the lemma is proved. □ 

5.3 The rearrangement identity 

If u, x £ M. d we denote 



(the terms are summed in order of increasing distance from u, and this sum 
is defined with F(x)). 

Lemma 8. For any x,u,v G M. d we have that a.s. 



Proof. First, compute expectations: Let N U}X be the (random) number of 
stars in the ball B(u, \u — x\). Recall the well-known physics principle that 
says that the total gravitational pull on a point x from a uniformly distributed 
spherical shell of mass with center u, radius r and total mass M is equal to 




G {u \x)-G {v} {x) = K d (u-v). 



(15) 
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if r > \u — x\ and to M(u — x)/\u — x\ d if r < \u — x\ (this last fact follows 
from the harmonicity of the function x — » (u — x)/\u — x\ ). Therefore, by 
conditioning on the distances of the stars from u (as was done in Section 15.11 
above with u = 0), we get that 



E 



x 



N, 



u — x 



u,x I ir 

\u — xr 



Therefore 



E 



G {u} { 



u — X 
\u — xr 



kAu — x\ 



u — x 
\u — x\ c 



K d [U - X), 



SO 



E 



x 



E 



G {v h 



x 



K d (u - V). 



Now, let R > be large, and consider the truncated series 

Zi X 



Then 



G { n\x) 



G [ n\x) 



GP(x) 



E • 

\zi— u\<R 



\Zi X\ 



E 



Zi X 



I iy . rp I d ' 



Zi&A R Zi^B R 

where A R = B(u,R) \ B(v,R),B R = B(v,R) \ B(u,R). We show that the 
variance of this expression tends to when R — > oo: Partition the set 
B(u, R)AB(v, R) into 0(R d ~ 1 ) disjoint sets (Ej)j of Lebesgue measure 0(1) 
such that each Ej is contained in either A R or B R (see Figure [5J the constant 
in the big-0 depends on u and v), and for each j let 



Y; 



E 



Za X 



be the contribution to the force from stars in Ej. Then we can write 



G { n\x)-G 



x 



The K's are independent, and each has variance bounded from above by 



m\ 2 



E 



< 



E[|yj| 2 |card(Ej n {z, t } 
C 



R 2d- 



:E 



card(£^j fl {zi}i 



O 



R 2d- 
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Figure 5: The balls B(u,R) and B(v,R) and the sets Ej. 

(Note that this is true since B(u, R)AB(v, R) C R d \B(x, R/2) for sufficiently 
large R, see Figure E]). Therefore 

Var (GP(x) - G% } {zj) = 0(R~ d+1 ) — > 0, 

which finishes the proof, since a.s. G^(x) — > G^(x) and (x) — > G^(x) 
as R — > oo □ 



5.4 Proof of Propositions CD and [5] 

Both Propositions [1] and [5] follow immediately from the following theorem. 

Theorem 9. With probability 1, the following four statements hold: 

(i) The sum defining G^(x) converges simultaneously for all u E M. d and 
x ER d \Z. 

(ii) The convergence is uniform on compact sets in M. d x (M. d \ Z). 
(Hi) The rearrangement identity ( Fl5l) holds for all u,v and x. 

(iv) For all u the function G^(x) is continuously differentiate in x. 

Proof. By Lemma [7] we know that, off of a null event Q\, for all rational 
u E M. d the sum defining G^(x) converges simultaneously for all x EM. d \Z, 
and the convergence is uniform for x ranging in a compact set. 
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By Lemma [SI off of a null event Q 2 , the identity (IT51) holds provided u,v,x 
are rational. (By the continuity in x, the assumption that x is rational can 
be dropped outside fli U Q2-) 

Let N(u, R, e) be the number of stars in the e-neighborhood of the sphere 
of radius R around u. The mean of N(u,R,e) is at most CdR d ~ 1 e. Let 
A(u,R, e) be the event that N(u,R,e) is less than twice its mean. Then by 
Lemma HI 



F{A{u,R,e) c ) < expi-aR^s) 



(16) 



for some a > 0. It follows by Borel-Cantelli that off of a null event Q3, 
for each rational q and e > 0, there is a (random) i?* = R*(q, e) such that 
A(q, R, e) holds for all R > R*(q, e) that are multiples of e. 

Now fix a configuration of stars uj ^ U ^ U 3 , and choose e > 0. 
For each u G M. d find a rational q = q(u,e) within distance e of u. Then for 
R > R*(q,e) and x G B(u,R/3), we have 



-4 9} (*: 



< 2iV(u, J R,e)(2/ J R) d - 1 < 2 d+1 C d e 



where G^(x) is defined as in the proof of Lemma [8] above. Thus 



lim sup 

R 



G%\x)-G^(x) 



< 2 d+1 C d e 



for all e > 0. This verifies (i) for u, and (ii), (iii), (iv) follow similarly by 
approximation. □ 



6 The joint density of a vector of forces 

In this section we prove an estimate that will be required in the proof of 
Theorem [3j Suppose we have iV points X\, . . . , xn G M. d with \xi — Xj\ > S 
for every i 7^ j. Fix a positive A, and define the event 

E = jThere is at least one star in B(xi, A) for every 1 < i < n\. 

Denote by A4 the cr-algebra generated by the locations of the stars in 
(U^Bfai, A)) c . Denote by X the random vector of forces (F(xi))i<i<N. 
Then we have the following bound on the joint density of X. 
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Theorem 10. There exist constants c , C\ > (depending on the dimension 
d) such that if 

A < co-—^-X> ( 17 ) 
(logiV)< 



then conditioned on the event E and on the a -algebra M., almost surely the 
joint density of X exists and is bounded from above by (Ci\ d ~ d ) N . 

We will use the following two simple lemmas. 

Lemma 11. There exists a constant C-j > (depending on d) such that if 
Xi,X2, . . . , ijv 6 R d satisfy \xi — Xj\ > S for all i ^ j , then for all 1 < i < N 
we have that 

j2 1 < CVlogiV 



i=2 



Xj-xi\ d S d 



Proof. For k = 1,2,..., let V k = {xi}f =2 D ^(^,2^) \ 2 fe " 1 1 S)) . 

Clearly \V k \ < ^/gff^f )} < 4 d • 2 M , and also trivially |T4| < iV. There- 
fore we have 



N 



Xj-a:i| d ~ ^ ^ |x - a;i| d ~ ^ 2 kd S d /2 d 



Ldlos2J 



ti lug ^ - ^ 



5<d ^ 2 fcd S ,<i 

log jV I 
d log 2 J 



fc=l fc_ I log -?V 



= {W\ +0 (i\ =0 (W 



S d \S d \ S d 



□ 



Lemma 12. If A = {aij)^j =1 is a matrix such that \a^i\ > 2J2j^i\ a i,j\ f or 
all 1 < i < k, then 

1 k 

I det A| > git U loi.il ■ 

i=l 

Proof. This is a variant of Hadamard's theorem in linear algebra. First, by 
multiplying each row of A by a~l, we may assume without loss of generality 
that a iti = 1 for all 1 < i < k, so A = I + B, where / is the identity matrix 
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and B = (bij) is a matrix such that . \bij\ < 1/2 for all 1 < % < k. For all 
x G R fc we have 



Therefore 



l-Bxjloo < Hxllooinax^ l&^l < -\\x\ 



-AX OO J.X ~\~ Bx OO — CO 5lC OO ^ r* ^ CO * 



2' 

We have shown that all the eigenvalues of A are greater in absolute value 
than 1/2, therefore | detv4| > 2~ k , as claimed. □ 

Proof of Theorem [73 Let us condition everything on the event E and more- 
over on the number of stars z/j in B(xi,X) for each 1 < i < N. With this 
conditioning, the set of stars in B(xi, A) is simply a vector (Y^i, Y i<2 , ■ ■ ■ , Y ijVi ) 
of Vi i.i.d. points chosen uniformly in B(xi,X). Now condition further on 
the a-algebra M. and on the a-algebra C generated by the locations of the 
stars {Yij \ I < i < N,2 < j < z/j}. This leaves only the stars (l^,i)i<i<:jv as 
a source of randomness. The vector X of forces can therefore be written as 

X = G(Yi i, Y 2: \, . . . , Y N> i) + Z, 

where Z represents the contribution that is measurable with respect to AiVC, 
and where G : B(xi, A) x B(x 2 , A) x . . . x B(x N , A) — > R Nd is the function 
defined by 

(N N N 
\- Vj -xi yj -x 2 \ - yj - x N 
id' z^i id' ••• ' z^i \d 
w - xi\ d pi \yj - x 2 \ d p x \ Vj - x N \ d 

Denote B = B(xi,X) x B(x 2 , A) x ... x B(x N ,X). The volume of B is 
K N ^Nd^ Therefore, to prove that the joint density of G(Yx > i, Y 2 i, . . . ,Yj 



N,l, 



and therefore also the joint density of X conditioned on the event E and on 
the cr-algebra M. V£, is bounded from above by (CiA^ 2- ^)^, it will be enough 
to prove two things: First, that the function G : B — > M. Nd is one-to-one; 
and second, that the Jacobian of the function G : B — > M> Nd is bounded from 
below by (C^A^ 2 ) - ^, where C 2 > is some large constant. Interestingly, 
both of these claims require that the assumption (fTTj) hold for some constant 
c > 0. 
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We prove the first claim. Assume fllTj) . where Co > is some small 
constant whose value will be specified soon. Denote as before g{x) = x/\x\ d . 
Fix ai, . . . , ajv £ Our goal is to prove that if the system of equations 

TV 

^2g(y j -x i ) = a i , l<t<N (18) 

has a solution . . . ,y N ) G B, then this solution is unique. The following 
proof of this fact was suggested by the referee, and simplified an earlier proof. 
Assume the contrary: y = (yi, . . . , y^) G B, y' — (y[, . . . , y' N ) G B, and 

N N 

^2g(yj - Xi) = ^2g(y'j - x { ) for % = i,...,n, (19) 

3=1 3=1 

and the number e = maxj \yj — y'j \ does not vanish. Without loss of generality 
assume that \y± — y[\ = e. Introducing u = g(y± — xi),u' = g(y[ — Xi),v = 
Y!j=2 9{Vj ~ x i), v ' = Ylj=29iy'j - xi), we have u + v = u' + v' (by $J$) for 
2 = 1; other i will not be used), therefore 

| it — u'\ = \v — v'\. (20) 

We note that \yj — xi\ > \xj —xi\ — \yj —Xj\ > \xj — x\ \ — A for j > 1. Taking 
into account that \xj — Xi\ > S > 2A by (TT7|) . we get \yj — X\\ > ~\Xj — X\\. 
By (HOD, fori > 1, " 

\g(yj - xi) - g{y'j - xi)\ < c 



\Vj - 


-y'j 




min(|^ - x x \ 


d 1 

5 | 


y'j - xi\ 





\Vj - 




\Xj - 


- X!\ d 



d • 



By Lemma [TT| 

N N 



\V — V 



'| < J2 \9iVi ~ *0 ~ 9(y- - *i)| <CsJ2 r-^TR < 



C 99 e\ogN 



\xj — x\\ d S d 

j=2 j=2 1 J 11 



Here C99 does not depend on Co in (fTTl) as long as ^g^i/d < 0.5. 

Using PJJ and (HZJ), we get that \u - u'\ = \ v - v'\ < C ^ N < 
Cg 9 e(c /X) d , that is, 

\9(vi-xi)-9(yj-xi)\< c (cj>y (21) 

m-y'il 

27 



The function g : R d \ {0} -> R d \ {0} is invertible; g' 1 ^) = z/\z\&. Simi- 
larly to ([TDD, = 0(|z|~a=i). Thus if we restrict g to 5(0, A) and 
accordingly g^ 1 to M. d \ B(0, ts-l), then g~ l satisfies the Lipschitz condition 
with the constant CggA^ (even though M. d \ -8(0, t3=t) is not convex...). We 
consider z = g(yi — Xi), z' = g(y[ — xx) and get by (1211) and the Lipschitz 
condition 

\*~A ^ n fCo\ d \g~\z) - g-\z'V 



< ^99 T J i 71 — C98^ C 



\ g -\ z ) - g -^z')\ - ~uy ' \z-z 

which is a contradiction if Co is small enough, namely if CggCggCp < 1. This 
finishes the proof that G is one-to-one. 

It remains to prove that the Jacobian of G is bounded from below by 
(C 2 A d2 ) _7V , for some large constant C 2 > 0, again assuming ffTTj) . The Ja- 
cobian matrix J of G can be written as a block matrix {Ji,j)i<i,j<N, where 
each J it j is the dx d Jacobian matrix of the function jjj — * \y J Zx\d ■ Again by 
the computation of the matrix Dig(x), each Jjj is a diagonalizable matrix 
with one eigenvalue equal to — (d — l)\yj — Xi\~ d and d — 1 eigenvalues equal 
to — Furthermore, by choosing for each y^ the appropriate radial 

coordinate system (as a function of y^), we may assume that the blocks J^, 
1 < % < N are in diagonal form. Any other block J^- = (a»j,*!,j)i<fc,Kd f° r 
z 7^ j is not necessarily in diagonal form, but its entries satisfy 

di,j,k,i < CslVj ~ Xi\~ d < Ce\xj - Xi\~ d . 

Recall that our assumptions are that \y%— Xi\ < A and \yj— Xi\ > S — A > S/2 
for i ^ j. We wish to apply Lemma [T2l to the matrix J. By Lemma [TTJ the 
assumptions of Lemma [12] will hold if we have 

A-< > 2dC aC ^. 

This holds if A < C 8 nTj^fwrf > where C 8 = (2c/C 6 C 7 ) _1 / d . The conclusion of 
Lemma [12] is exactly our claim. □ 



7 The gravitational potential function 

We define a new function, the gravitational potential function. It will be 

defined in dimensions 5 and higher only, and is designed to be a stationary 
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centered random function that has as minus its gradient the force function 
F. If d > 5, the gravitational potential function U : M. d — > K is defined by 



U(x) 



— lim 

d — 2 t^oq 



E 

< : |a<— !b|<T 



-1 



d-2 



+ 



dn d 



rj~i2 



(22) 



As with the case of the force, we need to check that the potential function 
is a.s. defined and is well-behaved (in fact, in the case of the potential this 
is only true in dimensions 5 and higher). For any p > q > 0, denote by N Piq 
the random number of stars in B(0,p) \ B(0, q), and denote 



p,g 



E ■ 

i : q<\zi\<p 



I d-2 ' 



Let W Pt q be a random vector distributed uniformly on B(0,p) \ B(0, q). An 
easy computation gives the following. 



d p a 



-.d+a 



d + a p d — q d 



E[N Piq ] = \ar[:\;,,] =/,•,(//'' - ,/') 



(a ^ -d), 



N P , q M [\w m r d ]-N Pjq . d -.^4- d ., 



2 p a 



E[U m ] 
V&i[U Piq | N p>q ] 



Var[C/p iQ 



dUd , 2 2\ 

~2-(P -Q), 
N p , q -Vax[\W p , q \ 2 - d ] 



(23) 



d q 4 ~ d - - 4 " c 



d-4 



pd _ gd 



4 



9 9 

p- — q- 
pd _ qd 



E 



Var[Z7 P) g | N P) , 



+ Var 



E[U p>q | A/j 



P>9J 



dn d 

d-A \q 



-.d-4 



p" 



(24) 



Now, from (123]) and ([21]) it immediately follows that when d > 5, for any fixed 
a? e R d the limit in (122]) exists a.s. and defines a centered random variable. 
For any m,i6K <! denote 



1 



d-2 



E 

i : \zj— u\<R 



-1 



Zi X 



d-2 
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An easy computation (similar to the one in the proof of Lemma [H]) gives 
that ~E(H^(x)) = ^f\u — x\ 2 if \u — x\ < R. We have shown above that 
£7(0) = liniR^oo H^(0) converges a.s. Next, by Theorem[9]it follows that for 
any i£K l! the limit lim^oo H^(x) exists a.s., uniformly for x in compact 
sets, since the difference H { ^ } (x) - H { ° } (0) can be represented as minus the 
line integral of G^(-) (defined in Section [573]) from to x. By translation, 
it follows that the limit 

H {u \x) := lim H { n\x) 

converges a.s. for any fixed u G M. d uniformly as x ranges over compact sets. 
As before, H^{x) satisfies a rearrangement identity similar to ([13]) : 

Lemma 13. For any x, u, v G ~R d we have that a.s. 

HM(X) - H^(X) = y (\U ~ X\ 2 ~\V- X\ 2 ) . 

We omit the proof, which is similar to the proof of Lemma [BJ and is also 
superseded by the following stronger lemma. 

Lemma 14. Almost surely, we have that 

-> 0. 



max 

ueB(o,i) 



/# } (o)-£|«| a -f#>(o) 



2 

Proof. For any m G N and e > 0, consider the event 

JL = { max max M u} (0) - — \u\ 2 - M°>(0) 

m I m<R<m+l UEB(0,1) H 2 



> £ 



We shall show that for any e > we have Y^m=i ^(^m) < 00 • By Borel- 
Cantelli, that implies the claim of the lemma. 

To that end, fix a large m G N. Let E m be a -^-net of numbers in [m, m + 
1], and let N m be a -^~net in B(0, 1), choosing nets such that \E m x jV m | = 
0(m 2d+2 ) and such that for all v G iV m and r G £7 m we have (r — m) > m~ 2 /2 
and |f I > m~ 2 /2. For t> G A" m and r G i? m denote A mj „ jr = B(0,m)AB(v,r) 
and ^ m ,i>,r = card(ZnA m>t , jr ) (the number of stars in A TO> „ jr ). For v G N m , r G 
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E m define events 



K 



Vol(A w ) > m ai (Vol(A m)Wjr )) 



£? ( v, r H r ) \ B (v, r ) contains 



> 20K d m d 3 stars 



5. 



e 

m,v,r 



^ w (0)-fl-l 2 -^ 0} (0)|>|}. 



Because the number of stars in a region has the Poisson distribution, by 
Lemma H] we get that for some constants C,c> 0, for all m we have 



P(#m,«,r) < Ce 



P(£r. 



(25) 



(note that our choice of the nets _E m and N m forces Vol(A m ?J>r .) to go to 
infinity when m grows large). Next, we derive a bound for P(S^ v r fl „ r ). 
Denote 

w w = ^ w (o)-|H 2 -^(o), 

and observe that W m ^ yr is a centered random variable that, conditioned on 
the event v m , v ,r = k, can be written as a constant e mj1Jjr . := -r^ (^-(r 2 — 
plus a sum of /c i.i.d. random variables with values in [— (m — l) 2 ~ d , (m 
l) 2 ~ d ]. Therefore we have that 



m 2 ) — 



B{y m ,v,r) 

k 

Vol(A miJJj , 

(since E(W mjU)7 .) = 0). Now, take such that 

| A; - Vol(A w )| < m 01 (VoKA^,,)) 172 . 
Noting that for some constant C2 > we have that 

Vol(A mj „ ir .) > C2m d ~ 1 max(r — m, \v\), 
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it follows that 



E(W m>v>r Wm,v,r = k) = mV^O ( ^ , + ) = O K°- 9 ) 

\ max(r — m, \v\) J 

(since d > 5). In particular, for such fc, for any e > we have for suffi- 
ciently large m that E(W nvUj7 .|k' m ,u,r = fc) < e/4, and it follows by Hoeffding's 
inequality applied to the representation of W m , v ,r conditioned on the event 
Wm,v,r = ^} described above that for some constants c 3 , c 4 > we have that 

It follows that 

P(^n^ r )<e- C ^ md " 3 , (26) 

which was our desired estimate. 

We now claim that for any fixed e > 0, for m sufficiently large we have 
that 

u 



Together with the above estimates (I25I) and (1261) . this will prove that 
^] m P(J^) < oo and therefore the claim of the lemma. To prove this, let 
u G 5(0,1) and R G [m,m + 1). Let v G N m and r G E m such that 
|u — u\ < m~ 2 , \R — r\ < m~ 2 . In particular, we have that the symmetric 
difference B(u, R)AB(v, r) satisfies 

B(u, R)AB(v, r)CB[v,r + — J \B[v,r- 2 



Then 



4" } (0)-yH 2 



#i° } (0) 



< 



# r M (0) 



+ 



K d i 



i?r(o)--i«i 



^ 0} (o) 



# r W (0) 



K d I |2 

2 |V| 



K m> v,r U ^m,v,r U •S'm.u.r ) did not occur, the 



Assuming the event LUiWeiSi 
first term in this bound is < e/2, and the second term is at most 



2 



\u\ 



20K d m d 3 d/o | ~ 



(m/2) 



d-2 



d 



21 C 

r < — . 

m 



This is also < e/2 if m is large enough, which means that did not occur. 

□ 
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Combining the above results as in Sectional we have proved: 

Proposition 15. If d > 5, the limit in (|22|) exists a.s. simultaneously for all 
x G M d \ {zi}i and defines a stationary centered process that is a.s. differen- 
tiable everywhere it is defined and satisfies 

i:\sa\<T 1 1 

VC/(x) = -F(x). (28) 

We will occasionally use truncated versions of the gravitational potential, 
the force and its first differential. For a bounded set A C M d , define U(x\A), 
the partial potential from stars in A, by 

Similarly, define F(:r|A), the partial force from stars in A by 

F(x|A)= Y * l ~ X wl - [ r^—^(N0\{z). 

A hZj - X d L 2 - X \ d 
i:z^A 1 4 1 JA 1 1 

For a set A C M d whose complement is bounded, define 

U(x\A) = U(x) -U(x\A c ), 
F(x\A) = F(x) -F(x\A c ). 

Note that U(x\A) and F(x\A) are centered to have mean 0. 

While these definitions are rather general, throughout the paper we only 
use sets A which are annuli of the form A = B(y,p) \ B(y,q), where < 
q < p < oo (this includes the degenerate cases of a ball, the complement of 
a ball, and the entire space). Furthermore, in all the cases we will consider, 
we will have that either [q > and \x — y\ < q] or [q = and \x — y\ < p\. 
In those cases, from the computations in the proof above it is easy to verify 
that we have the following explicit expressions for U(x\A) and F(x\A): First, 
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if q > and \x — y\ < q, then 
U(x\B(y,p)\B(y,q)) 



E 



d — 2 14 — ' | Zi — x 

q<\zi-y\<p, NT 



d-2 



2(d-2 



(29) 



F{x\B{y,p)\B{y,q)) 



E 



9<|2i-J/|<P, kilt 



Second, if g = and |a; — j/| < p then 

1 



F(x\B(y,p)) 



E 



d-2 ^ k-x| d - 2 2(d-2) 



V 



E 



l^i-J/|<P> NT 



+ /^(x - y). 



We will also use the function DiF(x\A), the first differential of F(x\A). By 
Lemma[7]we have the following explicit expressions for DiF(x\A) in the cases 
described above: If q > and |x — i/| < q then 



D 1 F(x\B(y,p)\B(y,q)) = ^ D x 

9<ki-y|<p 5 NT 

and if q — and |x — y\ < p then 



x 



D 1 F(x\B(y,p))= D ' 

\zi-y\<p, NT 



Zi X 



\Zi x\ 



dxdy 



(30) 



where Idxd is the d x d identity operator. 

In all the above sums, if the region of summation is infinite then the terms 
are summed in order of increasing distance from 0. 



8 Large deviations estimates 

In this section we derive detailed large deviations estimates for the force 
F(x), its derivative DiF, and (in dimensions 5 and higher) the gravitational 
potential function. 
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8.1 Large deviations for the value at a point 

Consistently with the previously defined notation, let D^F[x | A) denote 
the fc-th differential tensor of the function x — > F(x | A). 

Theorem 16. There exist constants C%, C2, C3 > s«cn that for all p > q > 
arid t > we have 

(|[/(0 I 5(0,p)\5(0,g))| >ij < Cie" C2<?d " 2 ' log (^), (31) 

(|F(0 I B(0,p)\B(0,q))\>t) < C ie - C2 ^ lllog (f), (32) 

p(D 1 F(0|B(0 I p)\B(0,g)) > t) < de^' 108 ^. (33) 

Equation (I3T]) no/ds m dimensions d > 5, and equations ( 132]) and ( |33|) no/d 
/or a// dimensions d > 3. 

Proof. Assume d > 5. Denote 5 P)9 = 5(0, p) \ 5(0, g). Let W Ptq , N Ptq ,U Ptq 
be as in Section so 17 (0 | 5(0,' p) \ 5(0, g)) = 3=5 (17 M - Ep/pJ). Let 
V" = |W / p ,g|~ ( ' d ~ 2 ' ) - Then for any u > we have 



P(|y| > u ) = p 
< 



W \ d ~ 2 
vy p,q\ 



> u 



P I \W m \ < 



u l/(d-2) 



Vol(B p , q )u d /( d - 2 ) ' 

Therefore, noting that < V < l/q d ~ 2 , we have for any integer k > 2 that 



E(\V\ k ) 



l/q d 



ku k - l ¥{\V\ > u)du < 



k 



K d 



Vo\(B Pj 



q) JO 



l/q d 



ku 



k-2- 



d ~ 2 du 



< 



Vo\(B m ) k-l-^\q 
6K d 



d-2 



k-l- 



Vo\(B m )q( d - 2 ) k - d ' 
For any 9 > we have 



(34) 



E(e 



1 + 6E(V) + V °-EV k < 1 + flE(V) + V } 



fc=2 



fc=2 



< 1 + 0E(V) + 



Vol(5 p , 
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Conditionally on N PtQ , the stars in B pq are a vector of N Ptq i.i.d points dis- 
tributed uniformly in B pq . Hence by the last estimate we get that 



E 



JU v , q I at 



*l 1 + ffl W + vi5^ 



Now, it is a simple exercise that if X ~ Poi(A), then for any a, we have 
E((l + a) x ) = e Xa . Since N m ~ Poi(Vol(S Pt9 )), using this in the above 
inequality we get 

E (e eu ™) < exp (9Vo\(B M )E(V) + 6K d q d e e/qd ^ . (35) 

Since also E[U p>q ] = Vol(B p>q )E(V), we get 

E ( e *(^. 9 -B(^. 9 ))) < exp (QK d q d e 9 ^- 2 ' 

Therefore for any t > we have 

W(U M - E[U p J >ij< exp (-0t + 6K d q d e e/<}d ~ 2 

Set = g d - 2 log(t/(6/t d g 2 )) to get 

F(u m - E[U p>q ] >tj < exp ( - g^ 2 tlog(t/(6/^g 2 )) + g d " 2 t 

= exp (- 9 " 2tlOE (^?))- 

In the same way, one gets a similar bound for the negative tail F(U Pt(] — 
^(U p>q ) < —t), by noting that (I3"5"j) also holds for negative values of 6* if on 
the right-hand side e e l qd 2 is replaced by e^ 9 ^ qd 2 . Combining the negative 
and positive tail bounds gives fl3Tl) . The estimates fl32|) and fl33l) follow (with 
the weaker assumption d > 3) by estimating in exactly the same way the 
moments and exponential moments of (Wp^l - ^ -1 ^ and |Wp g | -d , respectively, 
in place of |Wp !g | _ ( d ~ 2 ). Note that the random variables U{0 | B(0,p) \ 
B(0,q)),F(0 | B(0,p)\B(0,q)),D 1 F(0 \ B(0,p)\B(0,q)) are all centered. 
We omit the full proofs. □ 
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8.2 Uniform bounds in a ball 

Theorem 17. There exist constants C\, c 2 , C3 > such that for all p > q > 

and t > we have 



PI max 



P( max 

aseB(0,lAf ) 



U(x I B(0,p)\B(0,q))\ >tj < C ie " C29d " 2tlog (^)(36) 
F(x I B(0,p)\B(0,q)) > t) < C 1 e- C29<i " ltlog ( £ t 1 )(37) 



PI max 



D 1 F(x I B(0,p)\B(0,q)) > t) < de^ 29 '* 10 ^^^ (38) 



where equation (1361) /ioWs in dimensions d > 5, and equations (1371) and ( 1381) 
/ioW /or a// dimensions d > 3. 

Proof. Start with (1381) . Set r = t/q. We have 



P( max 

> xeB(0,lA§) 

< p. 



D^F^x I B(0,p)\B(0,g)) 
£>iF(0 I S(0,p)\B(0,g)) 



> t 

> - 
~ 2 



-P ( max 

.xe-B(0,lA§) 



D 2 F(x I fl(0,p)\fl(0,g)) 



> r . 



(39) 



The first of these two terms is bounded by C\e c 2? *i°g(c3*) ^ ([33]) p or ^he 
second term, observe that by ffTTT) we have that 



PI max 



.xes(o,iA|) 
00 

< E p 



> r 



m=0 V ^B(0,1A| 



D 2 F(x I S(0,p)\B(0,g)) 

D 2 F(x I B(0,p)n(B(0,2 m+1 q)\B(0,2 m q))) 



max 



> 



2m+l 



00 

< £p 

m=0 



> 



2m(d+l) qd+1 — 2 m +l 



where z/ m is the number of stars in -8(0, p) fl (-8(0, 2 m+1 q) \ B(0, 2 m q)), which 
is a Poisson random variable with mean < C2 dm q d . Using LemmaHJit follows 
that the above sum is less than 



J2 C exp (-cr2 m V +1 log(cgr)) . 



m=0 
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Now, if in the above inequality crq d+1 log(cgr) > 2 then the whole sum is less 
than a constant times its first term, so 



P max 



. xes(o,iA§ / 



D 2 F(x | B{0,p) \ B{0, q)) > r < C exp (—erg + log(cgr)) 



On the other hand, if crq d+1 log(cgr) < 2 then the above inequality holds 
trivially if we take C slightly larger since then the right-hand side is larger 
than 1. Hence this inequality holds for all values of r and q. Plugging this 
into equation fl39|) together with the fact mentioned after fl39|) gives f|38l) . 
Next, to prove fl37|) . write 



P( max 

-xes(o,iA§ ) 

< P 



> t 
t 



F(x | B(0,p)\B(0,q)) 
F(0 | B(0,p)\B(p,q)) 

+P( max D x F(x I B(0,p)\B(0,q)) 

VxeB(o,iAf) 



> 



> r 



where again r = t/q. Both of the terms are bounded by C\e C2qd 1 ' 1 °s( c 3*/ | 3) 
by ([32D and (|38l) . 

Finally, to prove fl36|) . write similarly 



P( max 

> xeB(0,lAf ) 

< p / 



[/(x | B(0,p)\B(0,q)) 



> t 



U(0 | B(0,p)\B(0,q)) 



> 



+P ( max 

. xes(o,iAf ) 



F(x | B(0, p) \ B(0,q)) > r) < c ie - C2qd ~ 2tlo ^ Cit/q2) 



by (J3TD and (J3ZD - □ 

By letting p — > oo we get the following limiting case of Theorem [T71 

Corollary 18. There exist constants C\, 02,03 > snc/i t/iat /or all q > 
and t > we have 



P( max 

> xeB(0,lAf ) 



P( max 

. xeB(0,lA|) 



P( max 

. xes(o,iA§) 



D 1 F( 



x 



R d \B(0,q)) 


>-*) 


R d \B(0,q)) 


>-*) 


R d \B(0,q)) 


>-*) 



-c 2 q d 2 tlog(^- 



(40) 
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where equation (j40j) holds in dimensions d > 5, and equations (1411) and ( 1421) 
hold for all dimensions d > 3. 



8.3 Uniform bounds in a ball with a moving domain 



Theorem 19. There exist constants C\,C2,c^,C^ > such that for all p > 
q > and t > we have that if t > C^p 2 or p = oo then 



PI max 



.xeB(0,lAf) 

P( max 

X6B(0,1A§) 



C/(s | 5(x,p)\5(x,g))| > t) < C ie ~ C29 ^ 2 * log (^)(43) 

C ie - C29<i_ltlog (^)(44) 



F(x | B(x,p)\B(x,q)) 



>t) < 



PI max 



D l F(x \ B{x,p)\B{x,q)) >t) < de' ^ 10 ^, (45) 



where equation ( 1431) /io/ds m dimensions d > 5 ; and equations ( 1441) and (1451) 
/io/d /or a// dimensions d > 3. 

Proof. Denote 5 = 5 (0, 1 A |). First, we prove (T431) in the limiting case 
when p = oo. Fix x & B, then 



d \B(x,q)) -U(x | R d \5(0,g)) 
f/(x | 5(0, q)) - U(x | 5(a;,g)) 

' 1 - 9 ^ |r. _ a; I d-2 + d - 9 ^ I r, — ' 



«i6-Ei 



d-2 : 



where Ei = 5(0, q) \ B(x, q) and E 2 = B(x, q) \ 5(0, q). Now, denoting by 
v q the number of stars in B (0, g + 1 A |) — 5 (0, g — 1 A |), it follows that 



U(x \R d \B(x,q)) < U(x | P d \5(0,g)) 



+ C 5 g 2 + 



(d-2)(g/2) 



d-2 ' 



Since z/ g is a Poisson random variable with mean < Cq d , by Lemma H] we 
obtain that for t > 3Csg 2 we have 



P^max U(x | M d \5(x,g)) > t J < P ^max [/( 



+P 



x 



(d-2)(g/2)^ 



d \5(0,g)) >t/3 

> V 3 



< Cexp I -ctq d ~ 2 \o. 



(46) 
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This also holds trivially for t < "SC^q 2 (since the RHS is larger than 1) 
provided c is chosen small enough, hence it gives (I43p in the case p = oo. 

Finally, to prove (jl3"]) in the general case, note, using (JIoT) twice and using 
the assumption t > C^p 2 , that 



max 



> t 



< P 
+P 



max 

xeB(0,lA|) 



max 



> 



U(x | B(x,p)\B(x,q)) 

U(x | R d \B(x,q)) 

U(x | R d \B(x,p)) 

< 2de 

The proofs of (liljl and ( ]4"5j) are similar and are omitted. 



t 

> - 
~ 2 



□ 



9 Proof of Theorem [3] in dimensions 5 and 
higher 

In this section, we assume that d > 5. Our goal is to bound the probability 
of the event Er of having a gravitational flow curve connect dQ(0, R) with 
dQ(0, 2R), as R — > oo. The case of dimensions 3 and 4 is slightly more deli- 
cate. In Section [10] we explain what modifications to the proof are required 
to complete the proof of Theorem [3] in that case. 

9.1 Reduction to a problem on a discrete set of points 



Fix the following parameters: 
B = R*'\ 

a large constant (depending on d) whose value will be specified later, 
A-(lo gj R) 2 / rf , 

l 

J£ 10(d^ + l) 



A 

r 
P 
s 
e 
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We emphasize that R is the only true parameter here, and the values of all 
the other quantities are specified as functions of R. 

To control the event Er, we discretize space. Introduce a grid of points 
in the region Q(0, 2R) \ Q(0, R), defined by 

S = rZ d f](Q(0,2R)\Q(0,R)). 

We think of S as an induced subgraph of rZ d with the usual lattice structure. 
Thus, two points w,w' G S are called adjacent if \w — w'\ = r. A set W C S 
is called connected if the induced subgraph of W in S is connected. Call 
a set W C S connectible if W is contained in a set W C S which is 
connected and \W'\ < 10 \W\. To each point w G S associate an inner box 
Qm( w ) — Q( w , r ) an d an outer box Q ut{w) = Q(w,2r). 

Lemma 20. There exists a constant C\$ > such that for any L > 1 the 
number of connectible sets W C S of cardinality L is at most R d C\^. 

Proof. This is an immediate consequence of [51 Eq. (4.24), p. 81] □ 

Definition 21. Say that w G S is bad if there exists a gravitational flow 
curve 7 connecting dQ in {w) with dQ out {w) such that at least one of the fol- 
lowing conditions hold: 

(1) U(x | B(x,3R 1/d )) < -f for all x G 7, or 

(2) f 7 \F(x)\-\dx\<^. 

We wish to show that the "bad" event Er, whose probability we are 
trying to bound, implies the occurrence of many bad grid points. This will 
be true except on some atypical events which will happen with probability 
small enough as to be of no consequence. Define 



fii = I max U(x) > B>, 

I x£Q(0,2R) J 

U(x I R d \ B(x, 3R 1/d )) I > ^} 



x&Q(0,2R) 

Q 2 — \ max 

I zeQ(0,2R) 

fi.s = <i max 

x&Q(0,3R) 



D 1 F(x I R d \B(x,3R 1/d )) > 



Sy/dp- 

Lemma 22. For some constants C, c > we have for all R sufficiently large 
that 

P(fii) < Ce- cRi/ \ (47) 
F(n 2 ) < Ce- cRll/ \ (48) 
F(Q 3 ) < Ce- cRl+1/W0d2 . (49) 
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Proof. First, we prove @BD- Cover Q{0,2R) with 0(R d ) balls {B^f*? of 
radius 1 . For each ball Bj we have by Theorem [TH that 



P( max 

x<=Bj 



U(x | R d \B(x,3R 1/d )) iJ 



Therefore by a union bound we get that for some new constant C > 0, 
P(fi 2 ) < C" R d e - cR<d ~ 2>/dBl °g( cB / R2/d ) 



Now substitute the values of the parameters to get (1481) . 

Next, (149p follows from Theorem [T9l in the same way as (148j) . 

Finally, to prove (j4"Tj) . let a > be some small positive number such that 
a < ((d — 2)/dn c i) 1 / 2 (another condition will be imposed on it shortly). Note 
that 

U(x) =U(x\ B(x, aVB)) + U(x | R d \ B(x, aVB)) , 

and that U(x \ B(x,aVB)) < ^z^a 2 ® < f (see eq. (125)1 ). so that on Q ± 
we have that 

U(x | R d \B(x,a\fB)) 



max 

x£Q(0,2R) 



2 



By a similar argument to that used in the proof of ( 14811 above, the probability 
of this is bounded by CR d e~ ca,d 2Bd/2lo s( c /a 2 ) _ jj? Q wag ^Qggn sufficiently small 
this gives the bound ( )4Tjl upon substituting the values of the parameters. □ 

Lemma 23. On £/ie eueni fll D f^, i/ £/iere exists a gravitational flow curve 
r connecting dQ(0,R) and dQ(0,2R) (that is, if Er occurred), and if R is 
large enough, then there exists a connectible family W C S of bad points, 
with \W\ > R/I0 d r. 

Proof. Let r : [0,T] -> M d be a flow curve that connects <9Q(0, -R) and 
dQ(0, 2R), and assume that fl± Uf^ did not occur. In particular, U(T(0)) < 
B. Observe that the potential U decreases along the curve T, since F(x) = 
—VU(x), and therefore 

jU(T(t)) = <f(t), W(r(t))> = (F(T(t)),-F(T(t))} = -\F(T(t))\ 2 - 

Let W be the set of points w e S such that T intersects both dQ- in {w) 
and dQ ont (w). Since T connects dQ(0,R) and dQ(0,2R), clearly we have 
that \W'\ > i?/r — 2 (the —2 is to account for boundary effects). 
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Let T x G [0, T] be the least time for which U(T(Ti)) < -B, or let T X = T 
if no such time exists (see Figure [H])- Certainly, all the points w G S for 
which IY ^ intersects both dQ- in {w) and dQ ont (vj) are bad (since, because 

f^2 occurred, they satisfy condition (1) in the definition). If there are R/10 d r 
such points, we are done, since the set of such w is connected and a fortiori 
connectible. If this is not so, denote by W" the set of those w G W for 
which Ti intersects both dQ- m (w) and dQ ont (w). The family W" is a 

I [0.?i] 

connected set, and we have \W"\ > \W'\ — R/10 d r — A d > R/2r (for R large; 
the A d is again to account for boundary effects near F(Ti)). For each w G W" 
let denote some segment of Ti that connects dQ in (w) with dQ out (w) 

(possibly in the opposite direction) and that is contained in the interior of 
Qout(w) \ Qm{w) except for its endpoints. Note that the segments (T w ) we w" 
are not necessarily disjoint. Replace W" by a subset W" C W" such that 
the interiors of (<5out(w))w)ew / '" are disjoint (and therefore also (T w ) wE w"' are 
disjoint except possibly for their endpoints) and \ W"'\ > \W"\/5 d . This can 
be done using a greedy method, since each point w G W" added to W" 
eliminates at most 5 d others. 

Let k denote the number of w G W" which are not bad. Then 

2B > U(T(0)) - C/(r(Ti)) 

10 d Br 



/ \F{x)\ ■ \dx\ > / \F(x)\ ■ \dx\ > k 



R 



|[0,Tu 



This gives that k < 2R/10 r, and therefore that the number of bad w G W" 
is > \W"'\ - 2R/Wr > \\W"'\ > R/Wr. 

Let W be the set of bad w G W". Then \W\ > \\W"'\ > ^s\W"\. Since 
W" is connected, it follows that W is connectible, so it satisfies the claim of 
the lemma. □ 

In the next subsection we prove the following theorem. 

Theorem 24. There exist constants C, c > such that for any family W C S 
we have 



¥\Q C 3 n {all w G W are badjj < C e - c|H/|logi? . 



(50) 



Before turning to the proof of Theorem [2H here's how to prove Theorem 
E] using it. 
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Figure 6: Schematic illustration of the proof of Lemma 

Proof of Theorem [3 Let E be the set of connectible families W C S with 
\W\ > R/10 d r. By Lemmas [201 [23 and [23] we have that 

JP(E R ) < p(ni) + p(n 2 ) + p(n 3 ) + n n ^ n n c 3 ) 

< Ce~ cRl+T ^ + ^Pm^njallwe Warebad}) 

< ^-c^+OT + £ ^ Pm° n {all w EW are bad}") 

r \ O / 1 niJ.. TJT/ *- V I TJ7I — r \ / 



LXR/10 d r We£, |W|=L 



L>R/W d r 
q( e -cR(lo S ,Ry-*/ d \ 



□ 
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9.2 Bounding the probability for a collection of points 
to be bad 



Fix a family W C S. Our goal is to prove the inequality ([50]) . First, note that 
we may assume without loss of generality that the set W is 12r-separated in 
the infinity-norm; that is, that for any w,w' G W with w ^ w' we have that 
\\w — w'Woo > 12r. Otherwise, replace W with a 12r-separated subset of it 
which has cardinality > |iy|/25 d (as in the proof of Lemma [231 above) . and 
prove (1501) for that subset. Throughout this subsection, we assume W C S 
is a 12r-separated family. 

The next lemma can be deduced easily from the Besicovich covering 
lemma (see [13]). For completeness we include a short direct proof. 



Lemma 25. Given a set of N balls (_B(o;;, r;))^, where r\ > 1 for alii and 
\xi — Xj\ > 1 for all i,j, there exists a subset (B(xi j , r^))!^ of pairwise- 
disjoint balls satisfying 

m 

>I > " v. (si) 

Proof. Assume that the radii are arranged in decreasing order. Construct 
the subsequence (xj.)j sequentially as follows: i\ = 1, and if we defined 
Xi i: Xi 2 , . . . ,Xi t , take i t +i to be the least index i > i t such that the ball 
B(xi,ri) is disjoint from U t j =1 B(xi j , r^), or, if there is no such i, set m = t 
and terminate. In the last step t = m, because the radii are decreasing, the 
fact that there was no index i satisfying the requirements implies that the 
set U 7 J l = iB(xi j ,2r i .) contains all the points Xj. Therefore, since > 1 for all 
i, we have that 

N m 

\jB{x l M2)c{jB{ 



5r v 



The balls (B(x h 1/2))^ are pairwise disjoint, so comparing the volumes of 
both sides we get 



~> d r d 



n d 2- d N < n d J2^° 
which finishes the proof. □ 
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For each w G S, define an event 

( B ~i 

fl 4}W = { max U(x\B(x,3R 1/d )\B(x,3r)) > —\ 

u( max DiF(rc|B(a:,3i2 1/d ) \B(x,3r))\ > —?=-}. 

lxeQ(w,9r) !, \ ~ AVdp) 

Define a random set (depending on the fixed family W) 

W = jtu G : ^4 jU , occurred | . 
Define an event (again depending on W) 

^ = {|W |>i|^|}. 

Lemma 26. Denote 5 — For some constants C, c > depending only 
on d we have that 

F(n 5 ,w) < Ce~ c \ w \ RS 

Proof. 

= p(b subcollection W CW with \W'\ > \W\/2 and p| fi 4 ,™ holds ) 



tuew 



< 2 |H/| max P( Pi ft 4lu J. 

wcw, \w\>\w\/2 VJ e ^, ' / 

Therefore it's enough to prove that for some constants C, c > 0, for any 
subcollection W C W we have 

P( fl < Ce~W. (52) 



Fix a subcollection W' C W. Denote a = 2 o(d 2 +i) - Define a finite sequence 
of scales 

R l / d = 1 1 >1 2 >1 3 > ...>l K = r 
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where we do not care about the precise values of the ij and only require that 
for each i we have 

k + i > ^ (53) 

and that K is a constant depending only on d; for example, it is possible to 
define such /; with K = 40d. 

For each w G W and each 1 < % < K define the event 



At, = <, max 

x£Q(w,3r) 



U(x\B(x,3li)\B(x,3l i+ i)) 



> 



B 



Ui max 

x£Q(w,3r) 



D 1 F(x\B(x,3l i )\B(x,3l i+1 )) 



> 



A{K-l)Vdp> 

We have, using Theorem dH that for some constants C2, C3 > the estimate 
HK) < Cr d exp (- Cl fc 2 £log ) + O d exp (- Cl l d +1 £ -log(^ 

holds. Using flo^l) and substituting the values B = R 8 ^ 9 , r = A • (logi?) 2 ^ 
and £ - = R d /^ d2+1 ^\ogR we obtain 

P(4,) < Cexp(-c 4 /f- 2 i? 8/9 - a(ci - 2) )+Cexp(-C4/fi? d/10{d2+1) - da ) 

< C exp (-cjf jR s/9-°(d-2)-2/d) + c exp(-c 4 /f i? 5 ) 

< Cexp (-cjfR 5 ) . (54) 

Note also that C UHi^u-- Therefore 

pf n ^ p f n ik) = p ( u n ^ 

VtueW" / i=l / V i:W'-+{l,2,...,K-l} weW 



< K\ w '\ max P ( n A*")) 



(55) 



Fix a function i : W — > {1, 2, . . . , X — 1}. We extract from the family of 

events < r a subfamily < \ of independent events, by using 
I J weW' I J weW" 

Lemma [25j By the definition of A l w , such a subfamily will be independent 
if the balls ( B(w, 7\fdli( w \) ) are disjoint. By Lemma [251 we can obtain 
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such a subfamily with Ylw&w" ^t(w) — (42v^) d |W'|. This gives, continuing 
(155]) and using (154) . that 



< Cexp ^-co- (42v / d)~ d - |W|i2*J. 

This proves (152]) and finishes the proof of the lemma. □ 
For each w G S introduce two subgrids of points 

S w = sZn{Q{w,2r)\Q{w,r)), 
T w = pZn(Q(w,2r)\Q(w,r)). 

For w G S, two subgrid points x, x' G S w are called adjacent if \x— x'\ = s. 
A chain is a sequence of points such that each two consecutive points are ad- 
jacent. A point x G S w is called an inner point of S w if dist(x, dQ(w, r)) < 
s, and it is called an outer point of S w if dist(x, dQ(w, 2r)) < s. 

Definition 27. A point x E~R d is called a-crowded if Q(x,as) contains a 
star. A point w G S is called percolating if there exists a chain of distinct 
points (xi) <i<k C S w such that x is an inner point of S W! Xk is an outer 
point of S w , and at least a 9 /10- fraction of the Xi's are 8-crowded. 

With W C S as above a 12r-separated family, define a random set (again 
depending on W) 

Wi = ju> G W : w is percolating j. 

Define an event 

fi 6 ^ = {|W 1 |>i|^|}. 

Lemma 28. For some constants C,c,a > depending only on d we have 
that 

P(^W) < Ce~ c \ w \ Ra . 
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Proof. For any x E M we have for some constant C\ > that 

F(x 8-crowded) = 1 - e~ (16s)d < Cl s d . 

Fix a w E S. For each k, the number of chains of distinct points (£i)o<i<fc C 
S w such that x is an inner point of S w and is an outer point of S w is at 
most 2d- (2r/s) d ^ 1 ■ {2d) k . Note that such chains can only exist if k > r/s, so 
in particular, the number of such chains is < C k for some constant C2 > 0. 
For each such chain C, there are < 2 k subsets of it of size at least 9/c/10. 
Given such a chain C and such a subset C of it, one may choose using 
a greedy method (as in Lemma [23] a further subset (a^)o<j<fc' of C with 
k' > 9k/(10 ■ 33 d ), such that for each i ^ j we have that ||a^ — o^||oo > 16s. 
Therefore the events [{x[ is 8-crowded}) 0<i<k , are independent. Therefore 
we have the bound 

F(w percolating) < ^ 2 k C k (c lS d ) ^ d < C 3 (C iS d ) { s > < C 6 e~ C7fla 

for some C3, C4, C5, Cq, cj, a > and all i? large enough. Now, because of the 
assumptions that the points of W are 12r-separated, the events ({w percolating}) 
are independent. Therefore 

p(Q 6iW r) = P^3 subcollection W CW with |W'| > \ W\/2 

and all w; G W 7 ' are percolating J 

□ 

Definition 29. G S, a point x G T^, zs called black if there exists a point 
y G Q(x,2p) suc/i i/m* either \F(y)\ < ™^ or C/(|/ | B(y^R l / d )) < -f . 

Lemma 30. a S is bad and not percolating, then the subgridT w contains 
at least 1Q Q dp points which are black and not 4-crowded. 

Proof. Let w G S be bad and not percolating. Let 7 be a gravitational 
flow curve connecting dQ(w,r) with dQ(w,2r) such that at least one of 
the conditions (1), (2) in Definition I2T1 holds. Let X be the set of points 
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x ET W such that some segment of 7 crosses from dQ(x, p) to dQ(x, 2p). Let 
J be the set of points x G S w such that some segment of 7 crosses from 
dQ{x,s) to dQ(x,2s). Note that | J\ > z - 2 > ^ (with the rightmost 
inequality holding for R sufficiently large), and for each x G J we have that 
I Tfl (Q(x, 2s)\Q(x, s))\ > £ — 2 > — , with the rightmost inequality holding 
for sufficiently large. 

First, we prove that the number v of x G X which are not black is at 
most wo r n d p - If condition (1) in Definition I2~T1 holds for 7, then for all x G X, 
for some 2 G Q(x,2p) \ Q(x,p) which is in the range of 7 we have that 
U(z I BfaZR 1 /*)) < -B/2, so x is black. So we have shown that if condition 
(1) in Definition I2T1 holds, all x G X are black. The other possibility is that 
condition (2) in Definition |2~H holds for 7. In that case, denote by 7' the union 
of those segments of 7 crossing from dQ(x,p) to dQ(x, 2p) for those x G X 
which are not black. It is not difficult to see that len(7 ; ) > pj^ (this uses a 
similar argument to the ones used at the beginning of this subsection and in 
the proofs of Lemmas [23] EH])- So, because of the definition of blackness we 
have that 

10 d Br f f 10000^5 

> / \F{x)\ ■ \dx\ > / \F(x)\ ■ \dx\ > len(V) 

J-y J-yi R 

10000^ p ■ v 



R 



> 



R 13 d 



and therefore v < , nn r ,, d , as claimed. 

Next, because of the assumption that w is not percolating, at least a 
1/10-fraction of the points x G J are not 8-crowded (minus 2 to account 
for boundary effects), or in absolute terms at least points in J . As in 
the proof of Lemma [28] above, we can choose a further subset J 1 C J of 
those points of J which are not 8-crowded which is 5s-separated and such 
that \J'\ > io'jjX - By the remark in the first paragraph of the proof, for 
each such x G J' we have at least — points y G X R (Q(x, 2s) \ Q(x, s)), 
and these y are not 4-crowded. That gives a total of at least 10 °f 1 d points 
y G X which are not 4-crowded, and these points are all distinct because J7' 
is 5s-separated. Since as we proved above at most 100 4 1 d p of them are not 
black, it follows that there are at least l0 Qd p points y G X which are black 
and not 4-crowded, as claimed. □ 
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Let W C S be a 12r-separated family. Denote 



n 



P| jat least points x E T w are black and not 4-crowdedj. 



Lemma 31. For some constants C,c> 0, we have 

P(£l 7 ,w) < Ce- clwllogR . 

Proof. Let N = \W\ < \S\ < (4Pt/r) d , and let W = {w u w 2 , . . . , w N } be 
some arbitrary ordering of the points of W, and define a random variable 



N 

X w = #{(^1, x 2 , ■ ■ ■ , xn) e JJ T u 

3=1 



Then, by Markov's inequality, 



are all black and not 4-crowded|. 



< 



100 d p 



100 d p 



-N 



100 d p 



N 



N 



n 



Pf p) n VL c A w . n {xj black, not 4-crowded}) 



(^i,..,x w )en"i^, i =1 



-AT 



E 



(xi,...,xjv)en^i^ 



iV 



il 

j=i 



(56) 



where A,-^ = Q.% n H {xj is black and not 4-crowded}. Denote A = 
2(d/ K d ) l l d (\ogR) l l d . Fixing (#i, . . . , %) for the moment to simplify the no- 
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tation, for 1 < j < N denote 



max 

x£Q(xj,2p) 



B 

> 4 



U(x | B(x,3r)\B(x,3s)) 
D x F(x | B(x, 3r) \ B(x, 3s)) 
no star in -B(xj, A) >, 



U< max 



> 



lQ^/dp 



A" 



\F(xj)\ < e, some stars in B(xj, A) 



We claim that 



Aj t xj C Aj U 



(57) 

Here is the proof: Assume Aj jXj holds. If for some y G Q(xj,2p) we have 
that \F(y)\ < 1000 ® dB ; which for i? sufficiently large implies \F(y)\ < ^, then: 
Either A'- occurred, or, if not, then from the definitions of 4-crowdedness and 
of the events f^, fl^ Wj , {A'-) c it follows that for R large enough, 



max \DiF(x)\ < max DiF(x \ B(x,3s)) 

x£Q(xj,2p) xeQ(xj,2p) 



+ max 

x£Q(xj,2p) 

+ max 

x£Q(xj,2p) 

+ max 

x£Q(xj,2p) 



DxF{x | B(x, 3r) \ B(x, 3s)) 
DiF(x | ^(a:, 3i? 1/d ) \5(x,3r)) 
L>iF(x | R d \£(x,3.R 1/ ' i )) 



e e 15e 

+ 7=- + — < 



IftVdp AVdp 8Vdp 32Vdp 



(the term y/d,Kd comes from eq. ( |30j) . note that it is << e/p for large R). 
Therefore \F( Xj ) \ < ^ + 2pVd ■ 



15e 



— =- — e, and since A\ did not occur there 



are stars in B(xj, A) and therefore A" occurred. 

The other possibility by blackness of Xj is that for some y G Q(xj,2p) 

we have that u(y B{y,3R 1 ^ d )^ < — -|. Then, because of ^% iWj we also 
have that u(y B(y,3r)^j < — -j, and because Xj is not 4-crowded we can 
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write equivalently Uyy B(y,3r) \ B(y,3s)^j < — j, so A'- occurred. This 

completes the proof of (iBTj) . 

Note that each A 1 - is measurable with respect to the locations of the stars 
in Q(wj,5.5r). Therefore, because the Wj are 12r-separated, {A'-)j is an 
independent family of events. By Theorem [19] we have for each j that 



PK-) < Ce~ csd ~ 2mog ^ +Ce 



-cs"~ log 



+ e 



1 



< — (for R sufficiently large.) 

Also, for any integers 1 < b\ < 62 <...<&*< iV, by Theorem [10] we have 
that almost surely 



where J r b 1 ,...,b i is the cr-algebra generated by the locations of the stars in 
\B(xb 17 A) U B(xb 2 , A) U . . . U B{xb i i A)l , provided that the assumptions of 
that theorem hold; because of the values chosen for the parameters r and A, 
this is true if A is chosen to be a sufficiently large constant. This is the only 
place where the value of A is important. So we can write, for sufficiently 
large R, 



E 



JV 



IT 



b=i 



< E 



JV 



n + 1 



< 



< 



< 



< 



E 

i=0 



JV 

yi ) ^ r . max pfAf n...ni" 



i=0 



N 

E 



j=0 



max 

01 < . . . < ajv- 
61 < . . . < 6j 
Vj,£ aj +b t 



P(<n...n^n<n...n^) 



61 < . . . < bi 



i J R d ( N ~*) 



R d 



R~ 



10(^ + 1) 20(d / + l) 



JV 



n A' 



ajv- 



< ( 2R 10 10(d 2 + l) 20(^ + 1) 



N 



(58) 
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(since for large R the polylogarithmic factor in e d CX d2 d can be bounded by 
i 

R 2 °( d ' 2+1 )). Now ( 1581) gives, using ( 1561) . that for sufficiently large i? we have 

/v 



7,Wj 



< 



< 



N / \ dN , 

T \ I d i d l 

2R 10 10(<* 2 + 1) 20(d 2 +l) 



100 d p 

CV^ -1 /?^ ( 2R~ Tr, ~ r " r 20(^+1) 



-i TV 



□ 

The only step remaining to complete the proof of Theorem [3] in dimensions 
5 and higher is the following. 



Proof of Theorem\24} As noted above, we assume that W is a 12r-separated 
family. For an event A denote F'(A) = ¥(A fl fig). The idea of the proof is 
roughly as follows. Because of Lemma [261 we ma Y replace W by a subset W 
so that \W'\ > \W\/2 and such that C\ W £W'£l\ w occurred. Because of Lemma 
fZE\ we may replace W by a further subset W" C W so that \W"\ > \W'\/2 
and all w G W" are not percolating. Finally, by Lemmas [30] and [311 the 
probability that all w e W" are bad is < C e - c|H/|logi? . 
Formally, we have 



P' ( P| {w bad} j < P'(0 5 ,w) + P' ( ^5,^ n fl { w bad 



+ e p/ f n O bad } 



4,to 



|W| > \W\/2 



< Ce -c|H/|^ +2 | W | max 

W c w, 



r n ({ 



w bad ^ fl fl 



4,u> 



> |W[/2 



For any W' we have 



" n ({- 



PI r l Nwbad!>n^ 



.(59) 
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< r (n 6iW ,) + r n c 6W , n fl (j™ bad} n ni 

+ ^2 p/ ( Pi {{ w bad ' not p erc ° iatin g} n q 

w" c W, \weW" 



4,w 



\W"\ > \W'\/2 

< Ce~ clw ' lRa 



+2 lw ' 1 max | P| ( jii; bad, not percolating} n fi^) J .(60) 



w" c w 

> \W'\/2 



\wew" 

For any W" we have, by Lemmas 



W p ({w bad, not percolating} nO^) < Ce- clw " llogR . (61) 



Combining §HD, 0, dSI]), and remembering that |W| > |W|/2 > |Vl^|/4, 
we get 



"fpi {wbad}^ <C e -^U° g « 



This completes the proof. □ 



10 Proof of Theorem M in dimensions 3 and 
4 

In this section, we prove Theorem [3] in dimensions 3 and 4. Much of the proof 
in dimensions 5 and higher remains unchanged. However, new large devia- 
tions estimates are required, as well as the introduction of a new function, 
the potential difference function. 

10.1 The potential difference function 

First, it is instructive to understand why the proof in Section [9] fails in 
dimensions 3 and 4. The difficulty is that the stationary potential function 
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U(x) cannot be denned. This can be seen from equation (I24p : in dimension 
3 the variance of U p±q diverges like a constant times p as p — > oo, and in 
dimension 4 like a constant times log p. 

However, the proof in Section [9] for the most part does not use the full 
stationary potential. After discarding the atypical events Q\,Q 2 , only the 
partial potential U(x \ A) is used for various sets A C B(x,3RV d ). So, to 
adapt the proof to dimensions 3 and 4, we replace these events with suitable 
adaptations of them. 

Assume for the rest of this section that d = 3 or 4. Define the potential 
difference function U diS : R d x R d -> R by 

U**( XiV ) = *Y ( , - V-O^-H 2 ) (62) 

v ,yj d-2^\Zi-y d - 2 \ Zi -x\ d - 2 2 Vl 1 |y| ; y ' 

where the sum is in order of increasing | Zi\ . We need to check that this sum 
converges a.s. This is true because, defining 

U%{*,V)= E (Trf^-TT^W). (63) 



\Zi\ < P, 
\zi —x\ > q, \zi -y\> q 



y\" * \Zi - x\ 



it is easy to check as in Section [7| that if \y\ < p — q and \x — y\ > 2q then 

nu*?(x,y)) = I (\z^^-\z-^^) dz 

J B(0,p)\B(x,q)\B(y,q) \\ Z ~ V\ \ Z ~ X \ J 

= ^^(M 2 -N 2 ), 



Var 



U%*{x,y)-U$*{x,y) 



O 



P 



d-2 



P 



■d-2 ] ' 



P,p 



OO. 



Similarly, using the methods of Section it is not difficult to prove the 
following. 



Lemma 32. The series (16"21) converges a.s. simultaneously for all x,y E 
R d \ {zi}i, and defines a centered process that a.s. is differentiable where it 
is defined and satisfies V x U dlff (x 1 y) = F(x), V y U dlff {x 1 y) = —F(y). 
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Remark. The potential difference can in fact be defined for all dimensions 
d > 3, and for dimensions d>5we have that U dlS (x,y) = U(y) — U(x). 

For a bounded set AcK^, denote 



U m (x,y | A) 



d-2 



Zi eA 



d-2 



-1 


-1 


Zi - y\ d ~ 2 


\Zi — x\ d ~ 2 


-l 


-1 


\z — y \ d ~ 2 


\z — X \ d ~ 2 



dz. 



For a set A C M d whose complement is bounded, denote 



£/ dltt (* )2/ | A) = f/ dltt (* )2/ ) - U dl \x,y | M d \ A). 

Again, it can be verified that if A C M d is an annulus of the form B(v,p) \ 
B(v, q), where v G M. d and < q < p < oo, then 

V x U dm (x,y | A) = F(x | A), V y U diS (x,y \ A) = -F(y | A). 

10.2 Large deviations estimates in dimensions 3 and 4 

The large deviations estimates which we prove in this subsection will com- 
plement the estimates in Section [HI 

Theorem 33. In dimension d = 4, there exist constants Ci, 02,03 > such 
that for all x, y G M. d and p > q > 2 satisfying p > 3q, \x\, \y\ < p/2, and 
\x — y\ > 3q, we have that 



P 



U dlff (x lV I B(0,p) \ (B(x,q) U B(y,q))) > t) < de' 02 ^ 05 ^ (64) 



for all t above a threshold that depends on q and \x — y\, as follows: 

t> Ciq 2 and 
t>C iq 2 \og (^) log(i 



\y-y\ 
q 



(65) 



Similarly, in dimension d = 3, there exist constants Ci, 02,03 > such that 
for all x, y G M. d , p > q > 2 and t > satisfying p > 3q, \x\, \y\ < p/2, and 
2 < q < t < g|sc — y\ we have that 



P 



U d ^(x,y I B(0,p)\(B(x,q)UB(y,q))) 



> t < do W- 



(66) 
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Proof. The proof is modeled after the proof of Theorem (THl Fix x,y,p and 
q. Denote B = B(0, p) \ B(x, q) \ B(y, q). Let W be a uniform random point 
in B, let iV be the number of stars in B, and let 

i: ^eB V 1 y 1 1 1 7 



diflf 



as defined in flggp, so that £/ diff (x, 7/ | B(0,p)\(B(x, q)UB(y, q))) = ^(U di 
E[U diS }). Let 

y= |W _ x |-(<*-2) _ |^_ y |-(d-2). 

Then for any u > we have 

wi>«> < p ( r ^>«)+ p ( r ^>« 

- Vo\{B)u d K d - 2 ) ' (6?) 

Suppose now d = 4. Noting that \V\ < 2/q 2 , for any integer k > 3 we have 
exactly as in f[3~4"|) that for some constant C-j > 0, 

2/q 2 n 



E(|l/| fc ) = / fat^PflFl > u)du < 
Jo 



Vo\(B)q 2k - 



Evidently, we need a better tail bound for \V\ to get anything useful for 
k = 2. To that end, note that 



\V\ 



\\W-x\ 


2 ~\W-y\ 2 \ 


- < - 


x-y\(\W 


— x\ 


+ \W-y\) 


\W -x 


|2 . \W-y\ 2 


\W-x\ 


2 ■ \W-y\ 2 



x -y\\ TF77 1 — TF77 i? + 



\W-x\-\W-y\ 2 \W-y\-\W-x\ 

Now, if \W — x\ and \W — y\ are both bigger than (2\x — yl/u) 1 ^ 3 , then a 
simple verification using the above inequality shows that \V\ < u. Thus, 



P(|V| >u) < P 



V V u 
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Combining the bounds from (157)) and (1551) and using the assumption that 
\x — y\ > 3q, we get that 



\*-y\ 
q 



1 1 -y Vol(5K/ 3 4^1-, Vo1(S)m 2 - Vol(fl) 

Thus, we have for any 9 > that 

^) * -^^ + ^g^ 

C 9 log f ^) r „4 

- v ; 2Vol(S) Vol(S) 

Now proceed as in the proof of Theorem [I5J Conditionally on N, the stars 
in B are a vector of N i.i.d points distributed uniformly in B, and therefore 

This implies as before that 

E (eW^-W^ < exp Qc 9 fl 2 log (^pH + C^^, 
whence, for any t > and 6* > we have 

p (u Am - E[u Am ] >t)<ex P (^-et+ l -c 9 e 2 log {^Y^j + c 7 gV/<^ . 

Take 6 1 = q 2 \og(t / Cjq 2 ) . Then, if we assume (1551) for some sufficiently large 
constant Ci > 0, we get that 

1 C^^O^] K 01 



2 °\ g y - 2 ' 

and that therefore 

P (f/diff _ E [U diS } >t) < exp(^-j + C 7 q 4 e e/q2 ^j 
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In a similar way, one obtains the bound for the negative tail, and this con- 
cludes the proof of (IMj) and the case d = 4. 

Turn now to the case d = 3. From fIBTj) and the fact that V < 2/q 2 we 
get as above that for some (77 > we have for any integer k > 4 that 

,2/g 2 



E(|V| 



ku k - L ¥(\V\ > u)du < 



Cr 



Vo\(B)q k - 3 ' 

To get useful bounds for k = 2 and k = 3, observe that 



1 1 




\W — x\ — \W — y\ 






\W - x | • \W - y\ 



\v\ 

Therefore 

PflVI >u) < P I \W-x\ < 



< 





X 




y 




\w- 


X 




W-y\ 



\x - y\ 



u 



1/2" 



+¥[\W-y\< 



\x - y\ 



u 



1/2" 



< a 



\x-y 



3/2 



Vo\(B)u 3 / 2 ' 



This gives that 

E(\V\ 3 ) < 



\*-y\-\ 2 C 8 \x-y\ 3 / 2 



3u 



Vol(B)u 3 / 2 



du 



2q-* 
\x-y\~ 1 



3n 2 „ 2K ± ndu< C9log(\x-y\/g) 



Vo\(B)u 3 



Vol (B) 



and similarly, 

E(\V\ 2 ) 



< 



\x-y\ 1 q 

2u 



\x-y 



3/2 



■du 



+ 



Vo\{B)u 3 / 2 
jU Vol(5) U 3 " Vol(S) 



Combining these bounds and proceeding with the same technique as above, 
we deduce that for any t > and 9 > we have 



C 



F ([ / ditt _ E ^d iU ] > t ) < exp ( _ &t+ ^e 2 \x -y\ + —9*101 

c _ 

fc=4 



9/)3- 



f ~ y\ 

Q 

(9/qT 
k\ , 
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Take 9 = r^-r, where A is a constant such that < A < in ,„ . . J\ .„ . , . 

\x—y\ ' 10(C7VC9VCioVl) 

From the assumptions 2<q<t<\x — y|, we get 

At 2 



9t 



C 10 9 2 \x-y\ 



\x — y\ 
C w A 2 t 2 



C 9 9 3 log 



29t 

\%-y\ 



o I /J - 0^10^4 < — , 

2\x — y\9t 2 10 



6 



9t 



C 9 AH\og 
6\x — y\ 2 



< CaA 2 < 



1 

Io : 



and similarly 



3 V^oo (S/q) k 



Cr 



9t 



=4 k\ < w 7 „3 



24#t 



C 7 # 3 1 C 7 A 3 . 1 



fc=4 



24gt 1 



< 



12 ~ 10 



(note that 9/q < 1/2). Therefore we get 



F(U am - E[U am ] > t) < exp ( -9t+ —9t 



exp 



-7 At 2 
10\x — y\ 



The bound for the negative tail is obtained similarly. This completes the 
proof of (JHSD- □ 



Corollary 34. In dimension d = A, there exist constants Ci, C2, C3 > such 
that for all x,y G M. d and p > q > 2 satisfying p > 3q, \x\, \y\ < p/2, and 
\x — y\ > 3q, we have that 



1 i max 

ueB(x,l),v&B(y,l) 



U dtff (u,v I B(0,p)\(B(u,q)UB(v,q))) 



> t 



< C x e 



-C2q 2 t log( ^ 



for all t that satisfies ( 1651) . Similarly, in dimension d = 3,there exist con- 
stants Ci, C2, C3 > such that for all x, y G M. d , p > q > 2 and t > satisfying 
p > 3q, \x\, \y\ < p/2, and 2 < q < t < \x — y\ we have that 



IP I max 

ueB(x,l),veB(y,l) 



U diff (u,v I B(0,p)\(B(u,q)UB(v,q))) 



> t 
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Proof. This follows from Theorem [32] in the same way that Theorem [Tj|] 
follows from Theorem (THJ We omit the proof. □ 

We also need large deviations estimates for the truncated potential func- 
tion. This differs from our estimates in dimensions 5 and higher in that the 
estimates are valid only in a restricted range of the parameters, depending 
on the dimension. 

Theorem 35. There exist constants C%, C2, C3 > such that for all p > q > 
we have that 

F(\u(0 I B(0,p)\B(0,q)) \ >t) < C ie ~ C2qd ~ 2n ° g ^) (69) 

for all t above a threshold that depends on d, p and q, as follows: 
t> Ciq 2 and 

t > C\q 2 log (^r^j log dimension d = 4; ' ' ' ' 

t > Ciq 2 and 

t>C ig 2 (log(^)) 2 log(£) and (71) 
t > C\pq log {Or \ in dimension d = 3. 

Proof. Let d = 4. In the notation of Theorem (THl we now have that 
holds only for k > 3. For k = 2 we have 



Now proceed exactly as in the proof of Theorem [221 above. 

For d = 3, we have, still in the notation of Theorem [THJ that (1341) holds 
only for k > 4. For = 2 we have 

1 1 1 p ' ql Vo\{B)J q Vo\{B) -Vol(S)' 

and similarly for fc = 3 we have 

62 



Proceeding as in the proofs above, this leads to the inequality 
P (U p , q - EU M >t)< exp f-6t + C 2Q 9 2 p + C 30 9 3 log (^j + c^V^M 

valid for all t > and 9 > 0. Taking 9 = qlog Cc~~^s\i t ms easily gives the 
positive-tail half of (169]) under the assumptions (1711) . As before the negative- 
tail half is proved similarly. □ 

Theorem 36. There exist constants C±, c 2 , c 3 , C4 > such that for all p > 
q > we have that, if t > C^p 2 , and if the same assumptions (1701) and (1711) 
as in Theorem hold, then we have 



P max 



.X6B(0,1A| 



(x I B(x, p) \ B(x, q)) \>tj< C 1 e~ C2qd ' 2tl ° g ^ . (72) 

Proof. This follows from Theorem [351 in the same way that Theorem HH] 
follows from Theorem [161 D 



10.3 Dimension 4 



Let B, A,r, p, s,e be the same as in Section 19.11 We redefine the events 
Qi,fi,2, as follows. 

= ( max U(x I B(x, 3R l/d )) > -), 



EQ(0,2R) 

^1 = n«un 



( max ( # of stars in Si? 1 ^)") > r > . 



max 

x,y e Q(0,2_R), 
I a: - y| > fl/100 



U m (x,y 



l \{B(x, 3R 1/d )UB(y, 3R 1/d )) 



> 2Y 



Let Q3 remain the same as in Section 19.11 The following lemma replaces 
Lemma [221 

Lemma 37. In dimension 4, for some constants C, c > we have for all 
R>2 that 

P(fii) < Ce- Cfll7/9 , (73) 

P(fi 2 ) < Ce- Cfln/9 , (74) 

P(fi 3 ) < Ce- cfil+1/10 ° d2 . (75) 
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Proof. First, note that = for R sufficiently large, since the only positive 
contribution to U(x | B(x, 3i? 1//d )) comes from its expected value (Qdn d /2)R 2 l d 
(see d23J). 

Next, to estimate the probability of Q**, cover Q(0, 2i? + m 1 ^) with 
0( J R d " 1 ) balls of radius Gi? 1 ^ so that for each x e Q(0, 2i2), the ball B(x, 3R}/ d ) 
is contained in one of them. In each of these balls, we need to estimate the 
probability that a Poisson random variable with mean C-R (for some constant 
C > 0) is > BR/ (10 • 100 d ). By Lemma H this probability is O (e"^ 10 ^) 
for some constant c > 0. 

Finally, the estimate for the probability of Q2 follows from Corollary [M] 
in the same way that f)48p follows from Theorem [THJ □ 



With these new definitions, the only further change required in the proof 
of Theorem 3 is the following revised proof of Lemma [231 All the other proofs 
remain correct as written, with appeals to (1431) being replaced by using (!72l) 
instead (one has to verify that the conditions under which ( 1721) may be used 
actually hold, but this is easy). 

Proof of Lemma \2^h in dimension 4- Let T : [0, T] — >• M. d be a gravitational 
flow curve that connects dQ(0, R) and dQ(0, 2R), and assume that Q1UQ2 did 
not occur. The potential difference U dlS (T(0),T(t)) decreases as a function 
of t. Let T = sup {t E [0,T] : \T(t) - T(0)\ < Let W be the set of 

points w G S such that T\ intersects both dQ- in {w) and dQ ont (w). Since 

\[To,T] 

r connects dQ(0, R) and dQ(0, 2R), we have that \W'\ > g^/lOOr^ (again 
the —2 is to account for boundary effects). 

Let Ti = sup{t G [T ,T] : U d[S (T(0), T(t)) > -2B}. Now, if x = T(t) 
for some t > T±, then by the definition of T\ we have 



U dm (T(0),x) < -25, 

and by VL^ we have 

f/ diff (T(0), x M d \ (B(r(0), 3R 1/d ) U 5(s, 3i? 1/d )) 
Therefore also 



u diS ( r(o), x 5(r(o), 3# 1/d ) u 3i? 1/d ) 



< 



B 

< —. 

~ 2 
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U m (r{0),x B(T(0),3R 1/d )U B(x,3R 



B(x, 3R 1/d )) - U(T(0) B(r(0), 3.R 1 



But 

U dm (] 
= U\x 
+u(x 
and by fll we have that 

-u(r(p) B(r(o),3i? 1/d )) > 



l/<^^ 



B(r(o),3i? 1/d )) -c/(r(o) 



B 

"2 



and that 



U x 



B{T{0),3R 1/d )) -U[T{0) 



B(x,3R l/d ' 



< 



BR 



1 < B 

10 ■ ioo d (i?/ioo) d - 2 - To' 



Therefore we get that, for x = T(t),t > T 1; we have 



U x 



B(x,3R l/d )) < 



9B 

'To 



< 



5 
"2 ' 



By the above, it follows that all the points w G S for which Ti intersects 

pi,T] 

both 9Q; n (w) and dQ out (w) are bad, since they satisfy condition (1) in the 
definition. If there are R/10 d r such points, we are done, since the set of such 
w is connected and a fortiori connectible. If this is not so, denote by W" the 
set of those w e W for which Y\ intersects both dQ- m {w) and dQ out (w). 

\[To,Ti] 

The family W" is a connected set, and we have \W"\ > \W'\ — R/Ar — 1 > 
74i?/100r - 4 > i*/2r. As in Section E3 replace W" by a subset W" C W 
such that | W"'\ > \W"\/h d and all the interiors of (Qont(w)) we w"' are disjoint. 
Repeating the same argument as in Section 19.11 we get that the set W of 
bad w G W" is connectible and contains > R/10 d r points. □ 



10.4 Dimension 3 

Let d = 3. All the foregoing discussion for dimension 4 remains valid, except 
the estimate (1741) . In dimension 3 we only get the weaker estimate 

P(f2 2 ) < Ce~ cR7/9 
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for some constants C,c > 0. Thus, while all the elements of the proof still 
function, what we actually proved was an upper bound for F(Er) which is 
of the form Ce~ cR?/9 . 

To get the better bound stated in Theorem [3], we modify the value of the 
parameters. Here are the new values: 



B = 


R 


QagR)P' 


r = 


(log R) 1/3 lo{ 


P = 


1 


(iog#K 




i 


s = 


(log R) s ' 


e = 


4 log log R, 


A = 


A/log logR. 



Here (3, 7, 5 are positive constants. The proof in Section [9J together with 
the adjustments of of Subsection 110.31 will work almost verbatim with these 
modified parameters, provided several conditions are met: 

• A < C ( log ^1/3 for some constant c > and all sufficiently large R. 
This is required when using Theorem [TUJ, and holds with our choice of 
parameters. 

• e > C-| for some constant C > 0. This is used in the proof of Lemma 
EH when we deduce from \F(y) \ < CB/R that in fact \F(y)\ < e/8. It 
will hold if 7 < (3 + 35, and in particular if 7 < (3. 

• ( - J e d \ d ~ d « 1. This is required when using Markov's inequality 
to ensure that the probability per site w G W to have > points 
x G T w which are black and not 3-crowded (Lemma I3TT) is << 1. This 
condition holds for any 7 > | + 95, and in particular for any 7 > | if 
5 is sufficiently small (as a function of 7). 

• — » 1. This is required in the proof of Lemma [311 to make sure that 
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With this choice of parameters, following the steps of the proof in Section [9] 
together with the changes outlined in Subsection 110.31 we get that in Lemma 
[371 the estimates (175]) and (175]) still hold. The estimate (174]) is replaced by 
the following estimate, whose proof again uses Corollary [541 

P(fia) < Ce~ c ^k^. (76) 

Lemma EH will be weakened to the following lemma, whose proof is a repe- 
tition of the same steps with the new parameter values. 

Lemma 38. For some constants C,c> 0, we have 

F{Q 7tW ) < c e - c|iy|loglogi? . 

As a result, Theorem [241 will be weakened to the following theorem. 

Theorem 39. There exist constants C, c > such that for any family W C S 
we have 

n {all w e W are badjj < c e - c|M/|loslogi? . (77) 
Therefore, the bound that we get for P(-Er) will be weakened to 

R R 

W(E R ) < P(fix) +P(fi 2 ) +P(fi 3 ) + e~ c - loglogR < Ce~ c ^v^ +Ce~ C (^v 1/3 . 

With the constraints (3 + 35 > 7 > 2/3 + 95, the best that one can do is to 
take (3 slightly bigger than 2/3. This gives that for all a > 4/3 we have 

for some constants c, C > depending on a and all R > 2. This was the 
claim of Theorem [5J in dimension 3. □ 
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